the file format for the noun break down thing is as follows

language flag			4 bytes
the number of characters used	4 bytes	
character mapping table 	1024 bytes
number of counjunction entries	4 bytes
the conjunction entries		8 bytes each
The number of nodes in the tree	4 bytes
the node indexes		4 bytes for each node
the size of the data table	4 bytes
the data table			x bytes as specified above



the language flag specified as follows

these are the language codes inside of dectalk...

english            0x0000
french             0x0001
german             0x0002
spanish            0x0003
japanese           0x0004
british            0x0005

