Commands by dopeman

Commands by dopeman (11)

sorted by

Remove duplicate rows of an un-sorted file based on a single column

The command (above) will remove any duplicate rows based on the FIRST column of data in an un-sorted file. The '$1' represents a positional parameter. You can change both instances of '$1' in the command to remove duplicates based on a different column, for instance, the third: awk '{ if ($3 in stored_lines) x=1; else print; stored_lines[$3]=1 }' infile.txt > outfile.txt Or you can change it to '$0' to base the removal on the whole row: awk '{ if ($0 in stored_lines) x=1; else print; stored_lines[$0]=1 }' infile.txt > outfile.txt ** Note: I wouldn't use this on a MASSIVE file, unless you're RAM-rich ;) **
This is sample output - yours may be different.
4

awk '{ if ($1 in stored_lines) x=1; else print; stored_lines[$1]=1 }' infile.txt > outfile.txt

dopeman · 2010-12-15 17:08:47 5
Backup a file with a date-time stamp

buf myfile.txt This is useful when you are making small but frequent changes to a file. It keeps things organised and clear for another administrator to see what changed and at what time. An overview of changes can be deduced using a simple: ls -ltr
This is sample output - yours may be different.
1

buf () { filename=$1; filetime=$(date +%Y%m%d_%H%M%S); cp ${filename} ${filename}_${filetime}; }

dopeman · 2010-12-14 13:19:52 6
easily find megabyte eating files or directories

Essentially the same as funky's alias, but will not traverse filesystems and has nicer formatting. Show Sample Output
This is sample output - yours may be different.
```
You could add this in your profile with a different format to make it look less daunting:
$ alias dush="du -xsm * | \
              sort -n | \
              awk '{ \
                printf(\"%4s MB  ./\",\$1); \
                for ( i=1 ; i<=NF ; i++ ) { \
                  if ( i > 1 ) printf(\"%s \",\$i) \
                }; \
                printf(\"\n\") \
              }' | \
              tail"
```
-1

alias dush="du -xsm * | sort -n | awk '{ printf(\"%4s MB ./\",\$1) ; for (i=1;i<=NF;i++) { if (i>1) printf(\"%s \",\$i) } ; printf(\"\n\") }' | tail"

dopeman · 2010-07-15 10:38:27 4
List open files that have no links to them on the filesystem

I have come across a situation in the past where someone has unlinked a file by running an 'rm' command against it while it was still being written to by a running process. The problem manifested itself when a 'df' command showed a filesystem at 100%, but this did not match the total value of a 'du -sk *'. When this happens, the process continues to write to the file but you can no longer see the file on the filesystem. Stopping and starting the process will, more often than not, get rid of the unlinked file, however this is not always possible on a live server. When you are in this situation you can use the 'lsof' command above to get the PID of the process that owns the file (in the sample output this is 23521). Run the following command to see a sym-link to the file (marked as deleted): cd /proc/23521/fd && ls -l Truncate the sym-link to regain your disk space: > /proc/23521/fd/3 I should point out that this is pretty brutal and *could* potentially destabilise your system depending on what process the file belongs to that you are truncating. Show Sample Output
This is sample output - yours may be different.
```
COMMAND     PID   USER   FD   TYPE DEVICE    SIZE NLINK   NODE NAME
a-proc    23521   root    3u   REG    8,1 5595979     0 353398 /some/logfile (deleted)
```
16

lsof +L1

dopeman · 2010-07-14 17:21:01 6
Search big files with long lines

This is a handy way to circumvent the "Maximum line length of 2048 exceeded" grep error. Once you have run the above command (or put it in your .bashrc), files can be searched using: lgrep search-string /file/to/search
This is sample output - yours may be different.
1

lgrep() { string=$1; file=$2; awk -v String=${string} '$0 ~ String' ${file}; }

dopeman · 2010-01-19 09:42:19 3
Search for an active process without catching the search-process

This does the same thing as many of the 'grep' based alternatives but allows a more finite control over the output. For example if you only wanted the process ID you could change the command: ps -ef | awk '/mingetty/ && !/awk/ {print $2}' If you wanted to kill the returned PID's: ps -ef | awk '/mingetty/ && !/awk/ {print $2}' | xargs -i kill {} Show Sample Output
This is sample output - yours may be different.
```
$ ps -ef | awk '/mingetty/ && !/awk/ {print}'
root      4976     1  0 Jul01 tty1     00:00:00 /sbin/mingetty --noclear tty1
root      4977     1  0 Jul01 tty2     00:00:00 /sbin/mingetty tty2
root      4978     1  0 Jul01 tty3     00:00:00 /sbin/mingetty tty3
root      4979     1  0 Jul01 tty4     00:00:00 /sbin/mingetty tty4
root      4980     1  0 Jul01 tty5     00:00:00 /sbin/mingetty tty5
root      4981     1  0 Jul01 tty6     00:00:00 /sbin/mingetty tty6
```
1

ps -ef | awk '/process-name/ && !/awk/ {print}'

dopeman · 2009-08-19 11:22:09 3
Copy files and directories from a remote machine to the local machine

This command will copy files and directories from a remote machine to the local one. Ensure you are in the local directory you want to populate with the remote files before running the command. To copy a directory and it's contents, you could: ssh user@host "(cd /path/to/a/directory ; tar cvf - ./targetdir)" | tar xvf - This is especially useful on *nix'es that don't have 'scp' installed by default.
This is sample output - yours may be different.
1

ssh user@host "(cd /path/to/remote/top/dir ; tar cvf - ./*)" | tar xvf -

dopeman · 2009-03-31 13:08:45 9
Show the 20 most CPU/Memory hungry processes

This command will show the 20 processes using the most CPU time (hungriest at the bottom). You can see the 20 most memory intensive processes (hungriest at the bottom) by running: ps aux | sort +3n | tail -20 Or, run both: echo "CPU:" && ps aux | sort +2n | tail -20 && echo "Memory:" && ps aux | sort +3n | tail -20
This is sample output - yours may be different.
3

ps aux | sort +2n | tail -20

dopeman · 2009-03-31 12:03:34 10
Find the 20 biggest directories on the current filesystem

This command will tell you the 20 biggest directories starting from your working directory and skips directories on other filesystems. Useful for resolving disk space issues.
This is sample output - yours may be different.
7

du -xk | sort -n | tail -20

dopeman · 2009-03-30 11:37:43 8
Change all instances of a word in all files in the current directory and it's sub-directories

This command will replace all instances of 'foo' with 'bar' in all files in the current working directory and any sub-directories.
This is sample output - yours may be different.
-1

perl -pi -e 's/foo/bar/g' $(grep -rl foo ./*)

dopeman · 2009-03-27 17:21:35 10
Change all instances of a word in all files in the current directory

This command will replace all instances of 'foo' with 'bar' in all files in the current working directory.
This is sample output - yours may be different.
-1

perl -pi -e 's/foo/bar/g' $(grep -l foo ./*)

dopeman · 2009-03-27 17:18:08 6

What's this?

commandlinefu.com is the place to record those command-line gems that you return to again and again. That way others can gain from your CLI wisdom and you from theirs too. All commands can be commented on, discussed and voted up or down.

Share Your Commands

Check These Out

Detect illegal access to kernel space, potentially useful for Meltdown detection

Based on capsule8 agent examples, not rigorously tested

Remove newlines from output

?Cat and grep? You can use only grep ("grep \. filename"). Better option is awk.

Watch the progress of 'dd'

The 'dd' command doesn't provide a progress when writing data. So, sending the "USR1" signal to the process will spit out its progress as it writes data. This command is superior to others on the site, as it doesn't require you to previously know the PID of the dd command.

Which processes are listening on a specific port (e.g. port 80)

swap out "80" for your port of interest. Can use port number or named ports e.g. "http"

Convert seconds to [DD:][HH:]MM:SS

Converts any number of seconds into days, hours, minutes and seconds. sec2dhms() { declare -i SS="$1" D=$(( SS / 86400 )) H=$(( SS % 86400 / 3600 )) M=$(( SS % 3600 / 60 )) S=$(( SS % 60 )) [ "$D" -gt 0 ] && echo -n "${D}:" [ "$H" -gt 0 ] && printf "%02g:" "$H" printf "%02g:%02g\n" "$M" "$S" }

ARP Scan

A much quicker and (not dirtier) option. use the man page for help. On linux/ubuntu you will need to `sudo apt-get -y install arp-scan`.

Quick find function

A function that allows you to perform a case-insensitive search in the current directory, and directories in the current directory (but no further), for files containing the first argument anywhere in their names.

Convert a Python interactive session to a python script

Used to copy and paste a terminal buffer of a python interactive session into an editor

Working random fact generator

Though without infinite time and knowledge of how the site will be designed in the future this may stop working, it still will serve as a simple straight forward starting point. This uses the observation that the only item marked as strong on the page is the single logical line that includes the italicized fact. If future revisions of the page show failure, or intermittent failure, one may simply alter the above to read. $ wget randomfunfacts.com -O - 2>/dev/null | tee lastfact | grep \ | sed "s;^.*$.*$.*$;\1;" The file lastfact, can then be examined whenever the command fails.

print file without duplicated lines usind awk

show file withou duplicated lines

Stay in the loop…

Follow the Tweets.

Every new command is wrapped in a tweet and posted to Twitter. Following the stream is a great way of staying abreast of the latest commands. For the more discerning, there are Twitter accounts for commands that get a minimum of 3 and 10 votes - that way only the great commands get tweeted.

» http://twitter.com/commandlinefu
» http://twitter.com/commandlinefu3
» http://twitter.com/commandlinefu10

Subscribe to the feeds.

Use your favourite RSS aggregator to stay in touch with the latest commands. There are feeds mirroring the 3 Twitter streams as well as for virtually every other subset (users, tags, functions,…):

Subscribe to the feed for:

» all commands
» commands with 3 up-votes (commandlinefu3)
» commands with 10 up-votes (commandlinefu10)
» commands by dopeman