Category Archives: linux

Unicode block names in regular expressions

Frequently, I find myself wanting to do some simple language detection. For Chinese, Japanese, and Korean, this can easily be done by looking at the types of characters in some text. The simplest and most robust way to do this is to use Unicode block names. It is very simple to write a regular expression […]

Posted in bash, java, perl, python, regex | Comments Off on Unicode block names in regular expressions

Pretty printing json

Here is a really simple way to pretty print some unformatted json $ echo ‘{"foo": "lorem", "bar": "ipsum"}’ | python -mjson.tool   {       "bar": "ipsum",       "foo": "lorem"   }

Posted in bash, python | Comments Off on Pretty printing json

Using awk to sum rows of numbers

I have a script which takes a tab-delmited file for regression tests, and converts it xml. I want to do a sanity check, to make sure that the number of utterances in my xml files matches the number in the tab-delimited.txt file. I can do this in 2 lines in UNIX robert_felty$ wc -l samples2.txt […]

Posted in bash, linux, UNIX | Comments Off on Using awk to sum rows of numbers

Vetting vignetting

We recently got some family portraits taken at JCPenney. I think they turned out very nicely. They had a bunch of different effects that they could apply. While we were at the studio, I really liked the way that the vignetting effect brought out our faces. However, after investigating the full set (we bought the […]

Posted in linux, photography | 2 Comments

UNIX/Linux permissions and groups – getent

I keep forgetting this command, so I writing it here so I know where to find it. Getent will list information about users and groups on a UNIX/Linux system, including NIS and LDAP users, which is crucial networks with multiple nodes. For example to list information about a user named robert_felty, you can do: $ […]

Posted in linux | Comments Off on UNIX/Linux permissions and groups – getent