Monthly Archives: January 2019

UNIX tip of the day —
duplicate and replace lines with awk

Today I got some data I wanted to add to my machine learning training datasets for named entity recognition. My system is designed to be used with output from automatic speech recognition (ASR). It is frequently difficult to be certain whether ASR output will contain hyphens or not, e.g. (email, vs e-mail) so frequently I […]

Posted in linguistics, UNIX | Comments Off on UNIX tip of the day —
duplicate and replace lines with awk