Skip to main content

Basic AWK usage tutorial


“Awk" command illustration on how to process input strings and get desired output.

Example below shows how "Awk" manipulates the string to get the data needed.

Below shows the output normal output listing of  "ls -l" command.

Output of  "ls"  below has "9" columns separated by space in between the columns.

"d" = In Linux if the listing starts with "d" it shows that the filename listed at the "9th" column or the last column is the name of the directiry.

"-" = In Linux if the listing starts with "-" it indicates that the data is just a regular file or not a directory of course.

Those are commonly used naming convention or indicators to determine the type of data. There are some other indicators used such as for symbolic links and hard links.

[ttxers@farmx99 honey_store]$ ls -l
total 16
drwx------ 3 branchx branchx 4096 Apr 26  2015 lists
drwx------ 3 branchx branchx 4096 Jun 30  2015 mailers
drwx------ 4 branchx branchx 4096 Apr 26  2015 spammers
drwx------ 3 branchx branchx 4096 Apr 26  2015 xmsxx


"Awk" command that prints columns 6 to 9. awk '{print $6,$7,$8,$9}'

The above awk print command can be illustrated as: awl '{print $col_number, $col_number}'

$6, $7, $8, $9 represent the columns from the ls -l output.

The "," between $6 to $9 will provide a space between each column. Or you can replace it with any  character of your choice.

[ttxers@farmx99 honey_store]$ ls -l | awk '{print $6,$7,$8,$9}'

Sample output produce by above command:

Apr 26 2015 lists
Jun 30 2015 mailers
Apr 26 2015 spammers
Apr 26 2015 xmsxx

Sort the output by month using the sort -M command.

[ttxers@farmx99 honey_store]$ ls -l | awk '{print $6,$7,$8,$9}' | sort -M

Apr 26 2015 lists
Apr 26 2015 spammers
Apr 26 2015 xmsxx
Jun 30 2015 mailers

To replace the white space and used other character use the command below:

[ttxers@farmx99 honey_store]$ ls -l | awk '{print $6 "-" $7 "-" $8 "-" $9}'

---
Apr-26-2015-lists
Jun-30-2015-mailers
Apr-26-2015-spammers
Apr-26-2015-xmsxx

Search the web for some complex examples. Start with basic and simple example to grasp the fundamental usage.

An example to delete files using Awk and Grep command.

 ls -l | grep "A001*.eml" | awk '{print $9}' |  echo  $(xargs)

 ls -l | grep "A001*.eml" | awk '{print $9}' |  rm -f  $(xargs)


Before deleting the file use the echo command to check which files are filtered by the grep command.

The rm -f will perform a force deletion without any prompt.

To delete a single file:

 ls -l | grep "2013" | awk '{print $9}' | yes | rm  A0010002.eml

Please do a test with a dummy data before executing the command in a production environment or else data loss will be the consequence.

Deletion will cause more harm than good, if not planned carefully and the person pressing the "enter" key does not exactly know what he/she is doing.



Comments

Popular posts from this blog

WMIC get computer name

WMIC get computer model, manufacturer, computer name and  username. WMIC is a command-line tool and that can generate information about computer model, its manufacturer, its username and other informations depending on the parameters provided. Why would you need a command line tool if there’s a GUI to check? If you have 20 or 100 computers, or even more. It’s quite a big task just checking the GUI to check the computer model and username. If you have remote computers, you need to delegate someone in the remote office or location to check. Or you can just write a batch file or script to automate the task. Here’s the code below on how get computer model, manufacturer and the username. Open an elevated command prompt and type:     wmic computersystem get "Model","Manufacturer", "Name", "UserName" Just copy and paste the code above, the word “computersystem” does not need to be change to a computer name. A...

Notepad++ convert multiple lines to a single line and vice versa

Notepad++ is an awesome text editing tool, it can accept regex to process the text data. If the data is in a “.csv” format or comma separated values which is basically just a text file that can either be opened using a text editor, excel or even word. Notepad++ can process the contents of the file using regex. Example if the data has multiple rows or lines, and what is needed is to convert the whole lines of data into a single line. Notepad++ can easily do it using regex. However, if the data is on a single line and it needs to be converted into multiple lines or rows then regex can also be used for this case. Here’s an example on how to convert multiple rows or lines into a single line. Example data: Multiple rows, just a sample data. Press Ctrl+H, and  on "Find what" type: [\r\n]+ and on "Replace with" type with: , (white space) --white space is needed if need to have a space in between the data. See image below, "Regular Expression" must be se...

Print error 016-799 - Fuji Film Xerox

016-799 Fuji Xerox or Fuji Film print error code. That shows a description error as “Print instruction Fail detected in decomposer.” The error code and error description are alien languages for users and even system administrators who are not familiar with Fuji Xerox error code. The error code is quite simple and easy to fix, if the job print goes to the printer but print out doesn’t come out. So, basically the print job was received by the printer, but the printer just doesn’t know what type of paper or what size to use or which tray to utilize for the print out. In some instances, this is just a paper mismatch but the error description; if using Windows 10 to print does not exactly points to what is the issue. First thing to check, is the paper size selected by the user to print. Example, if the printer configuration is A3 and A4 sizes only. But then the person printing the file accidentally chooses “A4 Cover” then this error 016-799 will occur. ...