This will match only words that include chemical elements: H + Og + W + Ar + Ts but not Eton. It allows internal overlap: LaG = La + Ag, and allows the first letter or last letter to be the end/start of a symbol to allow chaining. You can easily remove the various pieces, though. A single-line (non-"extended") regex is at the bottom of the test string