Pipeline For Downloading and Processing Files In Unix/Linux Environment With Perl

Why to do this using perl. use bash instead. Below is just a sample.

#!/bin/bashfor file in foo1 foo2 foo3do    wget http://samedomain.com/$file.gz .    if [ -f $file.gz ];    then        ./myscript.sh $file.gz >> output.txt    fidone

linux perl unix download pipeline

Try combining the commands using &&, so that the 2nd one runs only after the 1st one completes successfully.

system("(nohup wget $file  && ./myscript.sh $file >> output.txt) &");

linux perl unix download pipeline

If you want parallel processing, you can do it yourself with forking, or use a built in module to handle it for you. Try Parallel::ForkManager. You can see a bit more on it's usage in How can I manage a fork pool in Perl?, but the CPAN page for the module will have the real useful info. You probably want something like this:

use Parallel::ForkManager;my $MAX_PROCESSES = 8; # 8 parallel processes maxmy $pm = new Parallel::ForkManager($MAX_PROCESSES);my @files = glob("foo*.gz");foreach $file (@all_data) {  # Forks and returns the pid for the child:  my $pid = $pm->start and next;   my $downurls = "http://somedomain.com/".$file;  system("wget $file");  system("./myscript.sh $file >> output.txt");  $pm->finish; # Terminates the child process}print "All done!\n";

CodeHunter

Pipeline For Downloading and Processing Files In Unix/Linux Environment With Perl

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last