Commands tagged parallel sorted by votes

Commands tagged parallel (43)

sorted by

Find Duplicate Files (based on size first, then MD5 hash)

If you have the fdupes command, you'll save a lot of typing. It can do recursive searches (-r,-R) and it allows you to interactively select which of the duplicate files found you wish to keep or delete.
This is sample output - yours may be different.
23

fdupes -r .

Vilemirth · 2011-02-19 17:02:30 9
run command on a group of nodes in parallel

The pee command is in the moreutils package.
This is sample output - yours may be different.
16

echo "uptime" | pee "ssh host1" "ssh host2" "ssh host3"

dooblem · 2010-08-20 11:42:40 9
run command on a group of nodes in parallel
This is sample output - yours may be different.
15

echo "uptime" | tee >(ssh host1) >(ssh host2) >(ssh host3)

gutoyr · 2010-08-20 16:22:57 8
Recompress all .gz files in current directory using bzip2 running 1 job per CPU core in parallel
This is sample output - yours may be different.
10

parallel -j+0 "zcat {} | bzip2 >{.}.bz2 && rm {}" ::: *.gz

aramosf · 2010-11-12 01:15:44 5
a function to find the fastest free DNS server

Uses GNU Parallel. Show Sample Output
This is sample output - yours may be different.
```
208.67.222.222  ;; Query time: 18 msec
208.67.220.220  ;; Query time: 18 msec
198.153.194.1   ;; Query time: 22 msec
8.8.8.8 ;; Query time: 14 msec
8.8.4.4 ;; Query time: 18 msec
156.154.70.1    ;; Query time: 27 msec
198.153.192.1   ;; Query time: 42 msec
156.154.71.1    ;; Query time: 33 msec
```
8

timeDNS() { parallel -j0 --tag dig @{} "$*" ::: 208.67.222.222 208.67.220.220 198.153.192.1 198.153.194.1 156.154.70.1 156.154.71.1 8.8.8.8 8.8.4.4 | grep Query | sort -nk5; }

unixmonkey74668 · 2015-04-26 08:22:32 31
grep (or anything else) many files with multiprocessor power

Parallel does not suffer from the risk of mixing of output that xargs suffers from. -j+0 will run as many jobs in parallel as you have cores. With parallel you only need -0 (and -print0) if your filenames contain a '\n'. Parallel is from https://savannah.nongnu.org/projects/parallel/
This is sample output - yours may be different.
5

find . -type f | parallel -j+0 grep -i foobar

unixmonkey8046 · 2010-01-30 02:08:46 17
grep (or anything else) many files with multiprocessor power

xargs -P N spawns up to N worker processes. -n 40 means each grep command gets up to 40 file names each on the command line.
This is sample output - yours may be different.
4

find . -type f -print0 | xargs -0 -P 4 -n 40 grep -i foobar

ketil · 2009-08-05 23:18:44 8
recursive search and replace old with new string, inside files

xargs deals badly with special characters (such as space, ' and "). To see the problem try this: touch important_file touch 'not important_file' ls not* | xargs rm Parallel https://savannah.nongnu.org/projects/parallel/ does not have this problem.
This is sample output - yours may be different.
4

grep -rl oldstring . | parallel sed -i -e 's/oldstring/newstring/'

unixmonkey8046 · 2010-01-28 08:44:16 6
Pull git submodules in parallel using GNU parallel

Make sure to run this command in your git toplevel directory. Modify `-j4` as you like. You can also run any arbitrary command beside `git pull` in parallel on all of your git submodules. Show Sample Output
This is sample output - yours may be different.
```
/home/wei/.vim/bundle/Lucius
Already up-to-date.
/home/wei/.vim/bundle/YankRing.vim
Already up-to-date.
/home/wei/.vim/bundle/FuzzyFinder
Already up-to-date.
/home/wei/.vim/bundle/L9
Already up-to-date.
/home/wei/.vim/bundle/IndexedSearch
Already up-to-date.
/home/wei/.vim/bundle/a.vim
Already up-to-date.
/home/wei/.vim/bundle/jslint.vim
Already up-to-date.
```
4

parallel -j4 cd {}\; pwd\; git pull :::: <(git submodule status | awk '{print $2}')

clvv · 2011-06-20 00:20:26 6
Fastest segmented parallel sync of a remote directory over ssh

Mirror a remote directory using some tricks to maximize network speed. lftp:: coolest file transfer tool ever -u: username and password (pwd is merely a placeholder if you have ~/.ssh/id_rsa) -e: execute internal lftp commands set sftp:connect-program: use some specific command instead of plain ssh ssh:: -a -x -T: disable useless things -c arcfour: use the most efficient cipher specification -o Compression=no: disable compression to save CPU mirror: copy remote dir subtree to local dir -v: be verbose (cool progress bar and speed meter, one for each file in parallel) -c: continue interrupted file transfers if possible --loop: repeat mirror until no differences found --use-pget-n=3: transfer each file with 3 independent parallel TCP connections -P 2: transfer 2 files in parallel (totalling 6 TCP connections) sftp://remotehost:22: use sftp protocol on port 22 (you can give any other port if appropriate) You can play with values for --use-pget-n and/or -P to achieve maximum speed depending on the particular network. If the files are compressible removing "-o Compression=n" can be beneficial. Better create an alias for the command. Show Sample Output
This is sample output - yours may be different.
```
Transferring file `Maleficent.mkv'
`Maleficent.mkv', got 1890451456 of 9089412898 (20%) 6.82M/s eta:19m
ooooooo...............................ooooooo...............................ooooooo............

# the bar resembles the file being transferred, with 'o' representing data already received
```
4

lftp -u user,pwd -e "set sftp:connect-program 'ssh -a -x -T -c arcfour -o Compression=no'; mirror -v -c --loop --use-pget-n=3 -P 2 /remote/dir/ /local/dir/; quit" sftp://remotehost:22

colemar · 2014-10-17 00:29:34 10
run command on a group of nodes in parallel

Parallel is from https://savannah.nongnu.org/projects/parallel/ Other examples would be: (echo foss.org.my; echo www.debian.org; echo www.freenetproject.org) | parallel traceroute seq -f %04g 0 9999 | parallel -X rm pict{}.jpg
This is sample output - yours may be different.
3

seq 1 5 | parallel ssh {}.cluster.net uptime

unixmonkey8046 · 2010-01-28 08:18:50 6
run command on a group of nodes in parallel redirecting outputs

Do the same as pssh, just in shell syntax. Put your hosts in hostlist, one per line. Command outputs are gathered in output and error directories.
This is sample output - yours may be different.
3

xargs -n1 -P100 -I{} sh -c 'ssh {} uptime >output/{} 2>error/{}' <hostlist

dooblem · 2010-08-20 11:03:11 3
port scan using parallel

It takes over 5 seconds to scan a single port on a single host using nmap time (nmap -p 80 192.168.1.1 &> /dev/null) real 0m5.109s user 0m0.102s sys 0m0.004s It took netcat about 2.5 minutes to scan port 80 on the class C time (for NUM in {1..255} ; do nc -w 1 -z -v 192.168.1.${NUM} 80 ; done &> /dev/null) real 2m28.651s user 0m0.136s sys 0m0.341s Using parallel, I am able to scan port 80 on the entire class C in under 2 seconds time (seq 1 255 | parallel -j255 'nc -w 1 -z -v 192.168.1.{} 80' &> /dev/null) real 0m1.957s user 0m0.457s sys 0m0.994s
This is sample output - yours may be different.
3

seq 1 255 | parallel -j+0 'nc -w 1 -z -v 192.168.1.{} 80'

devrick0 · 2011-06-11 14:40:51 3
Fastest Sort. Sort Faster, Max Speed

sort is way slow by default. This tells sort to use a buffer equal to half of the available free memory. It also will use multiple process for the sort equal to the number of cpus on your machine (if greater than 1). For me, it is magnitudes faster. If you put this in your bash_profile or startup file, it will be set correctly when bash is started. sort -S1 --parallel=2 <(echo) &>/dev/null && alias sortfast='sort -S$(($(sed '\''/MemF/!d;s/[^0-9]*//g'\'' /proc/meminfo)/2048)) $([ `nproc` -gt 1 ]&&echo -n --parallel=`nproc`)' Alternative echo|sort -S10M --parallel=2 &>/dev/null && alias sortfast="command sort -S$(($(sed '/MemT/!d;s/[^0-9]*//g' /proc/meminfo)/1024-200)) --parallel=$(($(command grep -c ^proc /proc/cpuinfo)*2))" Show Sample Output
This is sample output - yours may be different.
```
89/490MB        4.22 2.03 1.01 2/85 7081
[24309:24308 0:4220] 08:01:28 Mon Feb 27 [root@galileo:pts/2 +1] ~
$ sort -S400M -u -i -f --parallel=4 files.nocombined.sorted | pv > ~/files.nocombined.reallysorted                                                                                                                                                                                                                              
9.95MB 0:01:39 [ 103kB/s] [     <=>             
```
3

alias sortfast='sort -S$(($(sed '\''/MemF/!d;s/[^0-9]*//g'\'' /proc/meminfo)/2048)) $([ `nproc` -gt 1 ]&&echo -n --parallel=`nproc`)'

AskApache · 2012-02-28 01:34:58 6
Convert entire audio library in parallel

Uses parallel processing Reiteration of my earlier command https://www.commandlinefu.com/commands/view/15246/convert-entire-music-library Usage lc Old_Directory New_DIrectory Old_Format New_Format lc ~/Music ~/Music_ogg mp3 ogg Show Sample Output
This is sample output - yours may be different.
3

lc() { od="$1"; nd="$2"; of=$3; nf=$4; cp -rl "$od" "$nd"; parallel -0 "ffmpeg -i {1} -loglevel error -q:a 6 {1.}.{2} && { rm {1}; echo {1.}.{2}; }" :::: <(find "$nd" -type f -iname \*$of -print0) ::: "$nf"; }

snipertyler · 2017-03-02 17:37:34 37
Unzip 25 zip files files at once
This is sample output - yours may be different.
3

find . -maxdepth 1 -name '*.zip' -print0 | xargs -0 -I {} -P 25 unzip {}

wuseman1 · 2022-11-20 14:07:44 457
Multi-thread any command

For instance: find . -type f -name '*.wav' -print0 |xargs -0 -P 3 -n 1 flac -V8 will encode all .wav files into FLAC in parallel. Explanation of xargs flags: -P [max-procs]: Max number of invocations to run at once. Set to 0 to run all at once [potentially dangerous re: excessive RAM usage]. -n [max-args]: Max number of arguments from the list to send to each invocation. -0: Stdin is a null-terminated list. I use xargs to build parallel-processing frameworks into my scripts like the one here: http://pastebin.com/1GvcifYa
This is sample output - yours may be different.
2

xargs -P 3 -n 1 <COMMAND> < <FILE_LIST>

h3xx · 2011-07-25 22:53:32 34
Compute the average number of KB per file for each dir

Use this to find identify if dirs mostly contain large or small files. Show Sample Output
This is sample output - yours may be different.
```
yeslab.org 300.32786885245901639344
theyesmen.org 679.16801687505732016752
```
2

parallel echo -n {}"\ "\;echo '$(du -s {} | awk "{print \$1}") / $(find {} | wc -l)' \| bc -l ::: *

unixmonkey8046 · 2011-07-28 12:21:34 4
Clone starred github repos in parallel with unlimited speed, this example will clone 25 repositories in parallel at same time.
This is sample output - yours may be different.
2

GITUSER=$(whoami); curl "https://api.github.com/users/${GITUSER}/starred?per_page=1000" | grep -o 'git@[^"]*' | parallel -j 25 'git clone {}'

wuseman1 · 2022-06-27 18:58:46 408
Execute a command on multiple hosts in parallel

Ssh to host1, host2, and host3, executing on each host and saving the output in {host}.log. I don't have the 'parallel' command installed, otherwise it sounds interesting and less cryptic.
This is sample output - yours may be different.
1

for host in host1 host2 host3; do ssh -n user@$host <command> > $host.log & done; wait

cout · 2010-07-14 14:55:31 3
Backup all MySQL Databases to individual files

Backs up all databases, excluding test, mysql, performance_schema, information_schema. Requires parallel to work, install parallel on Ubuntu by running: sudo aptitude install parallel
This is sample output - yours may be different.
1

mysql -e 'show databases' -s --skip-column-names | egrep -v "^(test|mysql|performance_schema|information_schema)$" | parallel --gnu "mysqldump --routines {} > {}_daily.sql"

intel352 · 2013-07-24 15:37:58 11
Apply, in parallel, a bc expression to CSV

Define a function that applies bc, the *nix calculator, with the specified expression to all rows of the input CSV. The first column is mapped to {1}, second one to {2}, and so forth. See sample output for an example. This function uses all available cores thanks to GNU Parallel. Requires GNU Parallel Show Sample Output
This is sample output - yours may be different.
```
$ paste -d, <(seq 10) <(seq 10 -1 1) | pbc 'sqrt({1}*{2})'
3.16227766016837933199
4.24264068711928514640
4.89897948556635619639
5.29150262212918118100
5.47722557505166113456
5.47722557505166113456
5.29150262212918118100
4.89897948556635619639
4.24264068711928514640
3.16227766016837933199
```
1

pbc () { parallel -C, -k -j100% "echo '$@' | bc -l"; }

eroenj · 2014-06-02 19:08:03 7
Compute the average number of KB per file for each dir

Shorter version using --tag
This is sample output - yours may be different.
1

parallel --tag echo '$(du -s {} | awk "{print \$1}") / $(find {} | wc -l)' \| bc -l ::: *

IndustrialJoe · 2018-07-05 18:33:21 275
wget download with multiple simultaneous connections
This is sample output - yours may be different.
1

cat url.list | parallel -j 8 wget -O {#}.html {}

arthurwayne · 2018-12-22 08:14:06 33
Count the total amount of hours of your music collection

First the find command finds all files in your current directory (.). This is piped to xargs to be able to run the next shell pipeline in parallel. The xargs -P argument specifies how many processes you want to run in parallel, you can set this higher than your core count as the duration reading is mainly IO bound. The -print0 and -0 arguments of find and xargs respectively are used to easily handle files with spaces or other special characters. A subshell is executed by xargs to have a shell pipeline for each file that is found by find. This pipeline extracts the duration and converts it to a format easily parsed by awk. ffmpeg reads the file and prints a lot of information about it, grep extracts the duration line. cut and sed cut out the time information, and tr converts the last . to a : to make it easier to split by awk. awk is a specialized programming language for use in shell scripts. Here we use it to split the time elements in 4 variables and add them up. Show Sample Output
This is sample output - yours may be different.
```
1036:17687:2689.686985895
```
1

find . -print0 | xargs -0 -P 40 -n 1 sh -c 'ffmpeg -i "$1" 2>&1 | grep "Duration:" | cut -d " " -f 4 | sed "s/.$//" | tr "." ":"' - | awk -F ':' '{ sum1+=$1; sum2+=$2; sum3+=$3; sum4+=$4 } END { printf "%.0f:%.0f:%.0f.%.0f\n", sum1, sum2, sum3, sum4 }'

pingiun · 2019-03-01 20:21:48 40
1 2 >

What's this?

commandlinefu.com is the place to record those command-line gems that you return to again and again. That way others can gain from your CLI wisdom and you from theirs too. All commands can be commented on, discussed and voted up or down.

Share Your Commands

Check These Out

recursive search and replace old with new string, inside files

Using -Z with grep and -0 with xargs handles file names with spaces and special characters.

Convert CSV to JSON

Replace 'csv_file.csv' with your filename.

Clear current session history (bash)

Sort all running processes by their memory & CPU usage

you can also pipe it to "tail" command to show 10 most memory using processes.

BourneShell: Go to previous directory

cd - would return to the previous directory of your cd command. NB: previous dir is always stored in $OLDPWD variable.

Copy a file using dd and watch its progress

This is a more accurate way to watch the progress of a dd process. The $DDPID=$! is needed so that you don't get the PID of the sleep. The sleep 1 is needed because in my testing at least, if you run kill -USR1 against dd too quickly, it will kill it off instead of display the status. So you need to wait a second, probably so that it can configure itself to trap the USR1 signal.

clear the X clipboard

Clears your clipboard if xsel is installed on your machine. If your xsel is dumb, you can also use $xsel --clear --clipboard

Convert all Flac in a directory to Mp3 using maximum quality variable bitrate

sync svn working copy and remote repository (auto adding new files)

Lists the local files that are not present in the remote repository (lines beginning with ?) and add them.

Replace + Find

Stay in the loop…

Follow the Tweets.

Every new command is wrapped in a tweet and posted to Twitter. Following the stream is a great way of staying abreast of the latest commands. For the more discerning, there are Twitter accounts for commands that get a minimum of 3 and 10 votes - that way only the great commands get tweeted.

» http://twitter.com/commandlinefu
» http://twitter.com/commandlinefu3
» http://twitter.com/commandlinefu10

Subscribe to the feeds.

Use your favourite RSS aggregator to stay in touch with the latest commands. There are feeds mirroring the 3 Twitter streams as well as for virtually every other subset (users, tags, functions,…):

Subscribe to the feed for:

» all commands
» commands with 3 up-votes (commandlinefu3)
» commands with 10 up-votes (commandlinefu10)
» commands tagged parallel

Commands tagged parallel (43) the last day the last week the last month all time sorted by date votes

What's this?

Check These Out

Stay in the loop…

Commands tagged parallel (43)

sorted by