# If you'd like to omit non-matching lines from the result; add ';d' to the end of the expression.
sed -E 's/(\b\w+\b)([ ]+\1\b)+/\1/msg;t' <<< "the little cat cat is in the hat hat hat, we like it.
the little cat cat is in the hat hat hat, we like it.
the little cat cat is in the hat hat hat, we like it.
the little cat cat is in the hat hat hat, we like it.
the the little cat cat is in the hat hat hat, we like it.
the the the little cat cat2 is in the hat hat hat2, we like it.
aa abb bc
ab bbc cd dc dc dc eg"
Please keep in mind that these code samples are automatically generated and are not guaranteed to work. If you find any syntax errors, feel free to submit a bug report. For a full regex reference for SED, please visit: https://www.gnu.org/software/sed/manual/html_node/The-_0022s_0022-Command.html