Commands using find sorted by votes

Commands using find (1,252)

sorted by

Find Duplicate Files (based on size first, then MD5 hash)

This is a modified version of the OP, wrapped into a bash function. This version handles newlines and other whitespace correctly, the original has problems with the thankfully rare case of newlines in the file names. It also allows checking an arbitrary number of directories against each other, which is nice when the directories that you think might have duplicates don't have a convenient common ancestor directory.
This is sample output - yours may be different.
0

find-duplicates () { find "$@" -not -empty -type f -printf "%s\0" | sort -rnz | uniq -dz | xargs -0 -I{} -n1 find "$@" -type f -size {}c -print0 | xargs -0 md5sum | sort | uniq -w32 --all-repeated=separate; }

mpeschke · 2013-01-23 23:20:26 5
How to find all open files by a process in Solaris 10

Executing pfiles will return a list of all descriptors utilized by the process We are interested in the S_IFREG entries since they are pointing usually to files In the line, there is the inode number of the file which we use in order to find the filename. The only bad thing is that in order not to search from / you have to suspect where could possibly be the file. Improvements more than welcome. lsof was not available in my case Show Sample Output
This is sample output - yours may be different.
```
user@machine# for i in `pfiles 25925|grep S_IFREG|awk '{print $5}'|awk -F":" '{print $2}'`; do find /path/tmp -inum $i |xargs ls -lah; done
-rw-r--r-- 1 user group 910 Jan 26  2012 /path/tmp/file1.prm
-rw-rw-rw- 1 user group 6.3K Jan 20 06:30 /path/tmp/file2.rpt
-rw-r--r-- 1 user group 86K Jan 23 23:34 /path/tmp/file3.log
-rwxrw-r-- 1 user group 65 Dec 26 07:27 /path/tmp/file4.pcr
```
0

for i in `pfiles pid|grep S_IFREG|awk '{print $5}'|awk -F":" '{print $2}'`; do find / -inum $i |xargs ls -lah; done

giorger · 2013-01-24 13:57:19 4
Recursively backup files
This is sample output - yours may be different.
0

find /var/www/ -name file -exec cp {} {}.bak \;

loicb · 2013-01-26 06:11:26 14
Recursively backup files

Let the shell handle the repetition in stead of find :)
This is sample output - yours may be different.
0

find /var/www/ -name file -exec cp {}{,.bak} \;

joepd · 2013-01-27 01:03:28 5
Extract text from all PDFs in curdir & subdirs to new files named as source+.txt, linux only.

Linux users wanting to extract text from PDF files in the current directory and its sub-directories can use this command. It requires "bash", "ps2ascii" and "par", and the PARINIT environment variable sanely set (see man par). WARNING: the file "junk.sh" will be created, run, and destroyed in the current directory, so you _must_ have sufficient rights. Edit the command if you need to avoid using the file name "junk.sh"
This is sample output - yours may be different.
0

echo '#!/bin/bash' > junk.sh ; find . -iname *.pdf -type f -printf \p\s\2\a\s\c\i\i\ \"%p\"\ \ \"%p\.\t\x\u\"\;\ \p\a\r\ \<\"%p\.\t\x\u\"\ \>\"%p\.\t\x\t\"\ \;\ \r\m\ \"%p\.\t\x\u\"\ \\n >>junk.sh; chmod 766 junk.sh; ./junk.sh ; rm junk.sh

p0g0 · 2013-01-27 21:29:08 176
Find corrupted jpeg image files

This checks jpeg data and metadata, should be grepped as needed, maybe a -B1 Warning for the first, and a -E "WARNING|ERROR" for the second part....
This is sample output - yours may be different.
0

find . -iname '*jpg' -print0 | xargs -0 exiftool -warning; find . -iname '*jpg' -print0 | xargs -0 jpeginfo -c

unixmonkey46191 · 2013-01-28 16:44:19 4
Batch rename and number files

This command will take the files in a directory, rename them, and then number them from 1...N. Black belt stuff. Hell of a time saver.
This is sample output - yours may be different.
0

find . -name '*.jpg' | awk 'BEGIN{ a=0 }{ printf "mv %s name%01d.jpg\n", $0, a++ }' | bash

doublescythe · 2013-02-07 06:12:37 4
Find Duplicate Files (based on size first, then MD5 hash)

Avoids the nested 'find' commands but doesn't seem to run any faster than syssyphus's solution.
This is sample output - yours may be different.
0

find . -type f -size +0 -printf "%-25s%p\n" | sort -n | uniq -D -w 25 | sed 's/^\w* *$.*$/md5sum "\1"/' | sh | sort | uniq -w32 --all-repeated=separate

jimetc · 2013-02-23 20:44:20 5
use the find command and have it not print trailing slashes

the advantage to doing it this way is that you can adjust the max depth to get more recursive results and run it on non GNU systems. It also won't print trailing slashes, which can easily be removed, but can be slightly annoying.. You could run: # for file in `find * -maxdepth 0 -type d`;do ls -d $file;done and in the ls -d part of the command you can put in whatever parameters you want to get things like permissions, time stamps, and ownership. Show Sample Output
This is sample output - yours may be different.
```
# find * -maxdepth 0 -type d
bin
boot
dev
etc
home
lib
lib64
lost+found
media
misc
mnt
opt
proc
root
sbin
selinux
srv
sys
tmp
usr
var
```
0

find * -maxdepth 0 -type d

sonic · 2013-02-25 21:10:49 8
Find and remove files older than 365 days
This is sample output - yours may be different.
0

find ./ -type f -mtime +365 -exec rm -f {} \;

szimbaro · 2013-02-27 12:36:35 4
Delete all files in folder without affecting load

While `echo rm * | batch` might seem to work, it might still raise the load of the system since `rm` will be _started_ when the load is low, but run for a long time. My proposed command executes a new `rm` execution once every minute when the load is small. Obviously, load could also be lower using `ionice`, but I still think this is a useful example for sequential batch jobs. Show Sample Output
This is sample output - yours may be different.
```
warning: commands will be executed using /bin/sh
job 40 at Fri Mar  1 16:07:00 2013
warning: commands will be executed using /bin/sh
job 41 at Fri Mar  1 16:07:00 2013
warning: commands will be executed using /bin/sh
job 42 at Fri Mar  1 16:07:00 2013
warning: commands will be executed using /bin/sh
job 43 at Fri Mar  1 16:07:00 2013
```
0

find . -type f -exec echo echo rm {} '|' batch ';'|bash

Ztyx · 2013-03-01 15:14:08 5
make a bunch of files based on a template file

make a bunch of files with the same permissions, owner, group, and content as a template file (handy if you have much to do w. .php, .html files or alike)
This is sample output - yours may be different.
0

echo "template file: ";read tpl;echo "new file(s separated w. space):"; read fl;touch $fl;find $fl -exec cp -ap $tpl "{}" \;

knoppix5 · 2013-03-08 10:00:36 6
find str in in a directory which file extension is .php
This is sample output - yours may be different.
0

find ./ -type f -name "*.php" | xargs grep -n "name" -r {}

motopig · 2013-03-08 11:03:49 4
list unique file extensions recursively for a path, include extension frequency stats

If you have GNU findutils, you can get only the file name with find /some/path -type f -printf '%f\n' instead of find /some/path -type f | gawk -F/ '{print $NF}' Show Sample Output
This is sample output - yours may be different.
```
100 txt
 25 gz
 10 vim
  3 gitignore
```
0

find /some/path -type f | gawk -F/ '{print $NF}' | gawk -F. '/\./{print $NF}' | sort | uniq -c | sort -rn

skkzsh · 2013-03-18 14:40:26 4
Recursively change permissions on directories, leave files alone.
This is sample output - yours may be different.
0

find /var/www/ -type f -print0 | xargs -0 chmod 644

FiloSottile · 2013-03-28 11:14:20 5
find potentially malicious PHP commands used in backdoors and aliked scripts

Searched strings: passthru, shell_exec, system, phpinfo, base64_decode, chmod, mkdir, fopen, fclose, readfile Since some of the strings may occur in normal text or legitimately you will need to adjust the command or the entire regex to suit your needs.
This is sample output - yours may be different.
0

find ./public_html/ -name \*.php -exec grep -HRnDskip "$passthru\|shell_exec\|system\|phpinfo\|base64_decode\|chmod\|mkdir\|fopen\|fclose\|readfile$ *(" {} \;

lpanebr · 2013-04-03 12:42:19 8
find potentially malicious PHP commands used in backdoors and aliked scripts

I have found that base64 encoded webshells and the like contain lots of data but hardly any newlines due to the formatting of their payloads. Checking the "width" will not catch everything, but then again, this is a fuzzy problem that relies on broad generalizations and heuristics that are never going to be perfect. What I have done is set an arbitrary threshold (200 for example) and compare the values that are produced by this script, only displaying those above the threshold. One webshell I tested this on scored 5000+ so I know it works for at least one piece of malware.
This is sample output - yours may be different.
0

for ii in $(find /path/to/docroot -type f -name \*.php); do echo $ii; wc -lc $ii | awk '{ nr=$2/($1 + 1); printf("%d\n",nr); }'; done

faceinthecrowd · 2013-04-05 19:06:17 10
Yardstick static analysis report sorted by which JavaScript files have the highest ratio of comments to code.

The number on the far right is ratio of comments to code, expressed as a percentage. For the rest of the Yardstick documentation see https://github.com/calmh/yardstick/blob/master/README.md#reported-metrics Show Sample Output
This is sample output - yours may be different.
```
map.js                                   4   0   13   4     31
  anon@0                                 1   0    3   1     33
conf.js                                  0   0    3   1     33
not_found.js                             1   0    3   1     33
circle_droppable.js                      7   0   38  17     45
  anon@50                                1   0    4   3     75
circle_default.js                        0   0    3   4    133
routes.js                                0   0    3   9    300
```
0

find . -name *js -type f | xargs yardstick | sort -k6 -n

noah · 2013-04-06 00:19:46 8
list all files are greater than 10mb

list all files are greater than 10mb lent from: http://www.tippscout.de/linux-grosze-dateien-finden_tipp_1653.html
This is sample output - yours may be different.
0

ls -lahS $(find / -type f -size +10000k)

Ichag · 2013-04-22 12:08:16 5
find ip address in all files in /etc directory

find ip address in all files in /etc directory. can be used to find any string in any directory really
This is sample output - yours may be different.
0

find /etc -type f -print0 | xargs -r0 grep --color '192.168.0.1'

jakezp · 2013-04-30 15:03:20 9
Run git gc in all git repositories bellow .

git gc should be run on all git repositories every 100 commits. This will help do do so if you have many git repositories ;-)
This is sample output - yours may be different.
0

find . -name .git -print0 | while read -d $'\0' g; do echo "$g"; cd "$g"; git gc --aggressive; cd -; done

Tungmar · 2013-05-09 08:03:23 6
Resize images with mogrify with lots of options

The "find $stuff -print0 | xargs -0 $command" pattern causes both find and xargs to use null-delineated paths, greatly reducing the probability of either hiccuping on even the weirdest of file/path names. It's also not strictly necessary to add the {} at the end of the xargs command line, as it'll put the files there automatically. Mind, in most environments, you could use find's "-exec" option to bypass xargs entirely: find . -name '*.jpg' -o -name '*.JPG' -exec mogrify -resize 1024">" -quality 40 {} + will use xargs-like "make sure the command line isn't too long" logic to run the mogrify command as few times as necessary (to run once per file, use a ';' instead of a '+' - just be sure to escape it properly).
This is sample output - yours may be different.
0

find . -name '*.jpg' -o -name '*.JPG' -print0 | xargs -0 mogrify -resize 1024">" -quality 40

minnmass · 2013-06-20 16:09:41 8
Resize images with mogrify with lots of options

The find command can do this on it's own. This is a shorter faster version, it also includes more advanced regex (it will find .Jpg etc). Find doesn't need a pipe, you can run it directly from the command.
This is sample output - yours may be different.
0

find . -name '*.[Jj][Pp][Gg]' -exec mogrify -resize 1024">" -quality 40 {} \;

hugme · 2013-06-21 13:27:25 7
Recursively remove all files in a CVS directory

This command removes and then cvs removes all files in the current directory recursively.
This is sample output - yours may be different.
0

find . -type f ! -path \*CVS\* -exec rm {} \; -exec cvs remove {} \;

jasonsydes · 2013-06-28 20:17:40 6
Recursively remove all empty directories

It starts in the current working directory. It removes the empty directory and its ancestors (unless the ancestor contains other elements than the empty directory itself). It will print a failure message for every directory that isn't empty. This command handles correctly directory names containing single or double quotes, spaces or newlines. If you do not want only to remove all the ancestors, just use: find . -empty -type d -print0 | xargs -0 rmdir
This is sample output - yours may be different.
0

find . -empty -type d -print0 | xargs -0 rmdir -p

rafar · 2013-07-01 02:44:57 6
‹ First < 31 32 33 34 35 > Last ›

What's this?

commandlinefu.com is the place to record those command-line gems that you return to again and again. That way others can gain from your CLI wisdom and you from theirs too. All commands can be commented on, discussed and voted up or down.

Share Your Commands

Check These Out

Display a cool clock on your terminal

This command displays a clock on your terminal which updates the time every second. Press Ctrl-C to exit. A couple of variants: A little bit bigger text: $ watch -t -n1 "date +%T|figlet -f big" You can try other figlet fonts, too. Big sideways characters: $ watch -n 1 -t '/usr/games/banner -w 30 $(date +%M:%S)' This requires a particular version of banner and a 40-line terminal or you can adjust the width ("30" here).

Generat a Random MAC address

Original author unknown (I believe off of a wifi hacking forum). Used in conjuction with ifconfig and cron.. can be handy (especially spoofing AP's)

Install pip with Proxy

Installs pip packages defining a proxy

Fix "broken" ID3 tags in the current directory and subdirectories

Some MP3s come with tags that don't work with all players. Also, some good tag editors like, EasyTAG output tags that don't work with all players. For example, EasyTAG saves the genre as a numeric field, which is not used correctly in Sansa MP3 players. This command corrects the ID3 tags in MP3 files using mid3iconv, which comes with mutagen. To install Mutagen on Fedora use "yum install python-mutagen"

Find usb device

I often use it to find recently added ou removed device, or using find in /dev, or anything similar. Just run the command, plug the device, and wait to see him and only him

A snooze button for xmms2 alarm clock

you can also run "xmms2 pause & at now +5min

Which processes are listening on a specific port (e.g. port 80)

swap out "80" for your port of interest. Can use port number or named ports e.g. "http"

Convert seconds to [DD:][HH:]MM:SS

Converts any number of seconds into days, hours, minutes and seconds. sec2dhms() { declare -i SS="$1" D=$(( SS / 86400 )) H=$(( SS % 86400 / 3600 )) M=$(( SS % 3600 / 60 )) S=$(( SS % 60 )) [ "$D" -gt 0 ] && echo -n "${D}:" [ "$H" -gt 0 ] && printf "%02g:" "$H" printf "%02g:%02g\n" "$M" "$S" }

Buffer in order to avoir mistakes with redirections that empty your files

A common mistake in Bash is to write command-line where there's command a reading a file and whose result is redirected to that file. It can be easily avoided because of : 1) warnings "-bash: file.txt: cannot overwrite existing file" 2) options (often "-i") that let the command directly modify the file but I like to have that small function that does the trick by waiting for the first command to end before trying to write into the file. Lots of things could probably done in a better way, if you know one...

Short one line while loop that outputs parameterized content from one file to another

Stay in the loop…

Follow the Tweets.

Every new command is wrapped in a tweet and posted to Twitter. Following the stream is a great way of staying abreast of the latest commands. For the more discerning, there are Twitter accounts for commands that get a minimum of 3 and 10 votes - that way only the great commands get tweeted.

» http://twitter.com/commandlinefu
» http://twitter.com/commandlinefu3
» http://twitter.com/commandlinefu10

Subscribe to the feeds.

Use your favourite RSS aggregator to stay in touch with the latest commands. There are feeds mirroring the 3 Twitter streams as well as for virtually every other subset (users, tags, functions,…):

Subscribe to the feed for:

» all commands
» commands with 3 up-votes (commandlinefu3)
» commands with 10 up-votes (commandlinefu10)
» commands using find

Commands using find (1,252) the last day the last week the last month all time sorted by date votes

What's this?

Check These Out

Stay in the loop…

Commands using find (1,252)

sorted by