Friday, July 10, 2009

[isgnvmlx] Word list

Create an English word list after stemming, formatted in in lines (not columns). Use stemming: only the suffix that changed from the previous word is given. Or include the last common letter. This is ambiguous, but a human can probably figure it out. Putting a fully expanded head word at the beginning of each line is tricky to format: what will fit on a line?

Even trickier: global optimization of choosing short head words at the expense of white space at the end of the previous line.

How many pages does it take to fit the entire English language?

It would be nice to have a font without descenders ( g j p q y ) so that the lines may be packed closer together. Kerning.

Another optimization problem: suffixes may be calculated in runs from the following word as well as the previous. Probably not useful and excessively confusing.

Related problem: Text width

No comments :