The following is a dictionary of 4991 portuguese words ordered by frequency. I need the file to have just number of line and word like the following 11 lines of example. So I need to delete all lines that don't start with a number and delete everything after number and word. Also delete all lines with a pipe symbol.