# Copyright 2002 by Eric House (fixin@peak.org). All rights reserved. # # This program is free software; you can redistribute it and/or # modify it under the terms of the GNU General Public License # as published by the Free Software Foundation; either version 2 # of the License, or (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program; if not, write to the Free Software # Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA. LANGCODE:en_US # deal with DOS files LANGFILTER_PRECLIP: tr -d '\r' | LANGFILTER_POSTCLIP: | tr [a-z] [A-Z] LANGFILTER_POSTCLIP: | grep '^[A-Z]*$' LANGFILTER_POSTCLIP: | tr -s '\n\r' '\000\000' LANGFILTER_POSTCLIP: | sort -z # We can trust sort (above) to do the right thing since there's no # high ascii. dict2dawg.pl is much faster if I can trust that its # input is in sorted order. NEEDSSORT:false LANGINFO:

English dictionaries can contain words with any of the 26 LANGINFO: letters you think of as making up the alphabet: A-Z. At LANGINFO: this point any word in your list containing anything else LANGINFO: will simply be excluded from the dictionary.

# High bit means "official". Next 7 bits are an enum where # English==1. Low byte is padding XLOC_HEADER:0x8100 2 0 {"_"} 9 1 'A' 2 3 'B' 2 3 'C' 4 2 'D' 12 1 'E' 2 4 'F' 3 2 'G' 2 4 'H' 9 1 'I' 1 8 'J' 1 5 'K' 4 1 'L' 2 3 'M' 6 1 'N' 8 1 'O' 2 3 'P' 1 10 'Q' 6 1 'R' 4 1 'S' 6 1 'T' 4 1 'U' 2 4 'V' 2 4 'W' 1 8 'X' 2 4 'Y' 1 10 'Z' # should ignore all after the above