Version

Date

History of Han Viet Chu Thich

0.5 03/24/2004 Hello World
0.51 03/25/2004 Create options to insert Han-Viet and/or links to TC entries - Create a forwarding page for desktop link
0.60 03/26/2004 Insert Simplified Characters and Unicode
0.61 03/28/2004 Refomat the output so that muliple Simplified (Mainland)/Traditional (Taiwan) Characters can be linked to one TC Entry Chinese punctuation marks are recognized and converted to the corresponding English counterparts in Han Viet annotations
0.62 04/02/2004 Insert PinYin in TC entries
0.63 04/03/2004 Option to display PinYin to accompany Han text
0.64 04/04/2004 Option to convert from Simplified -> Traditional Style and vice versa
0.64A 04/05/2004 Fix bug ( so that if a Chinese character is not found in the TC dictionary, it can still be displayed ). Also map the Chinese quotes << >> to English quotes " "
0.64B 05/19/2004 Add a few missing words ( đại, tha ) into the data base
0.64C 06/01/2004 Change the GUI design. Move the TextArea up a little bit and make it a little bigger
0.65 08/17/2004 Use 15K pairs of Phon-The(Traditiona)<-->GianThe(Simplified) to convert Chinese text from PhonThe style to GianThe style and vice versa. The Chinese character may or may not be included in the TC dictionary. Also add additional conversions from Chinese punctuation marks to Vietnamese/English punctuation marks
0.70 10/15/2004 Make the GUI design a bit more attractive
 
1.0 10/12/2006 Improve the  program after 2 years in service. Remove Thiều Chửu Dictionary's entries in the output page. If "Cho vào Chú Thích" option is selected,TC entries will be displayed in a separate window popup for the character being clicked. This design avoids massive outputs resulting from a large input text. The new design also improves performance.
Also implemented is the "Auto Detect" feature. If the text consists of Simplified characters only, the program will keep the text unchanged if the user selects "Giản Thể" option. Similarly if the text consists of Traditional characters only, the text will be unchanged if the user selects "Phồn Thể" option.
1.1 10/18/2006 Insert the compound words associated with the Chinese character which was clicked on the annotated page, assuming that the Chu Thich option had been selected on the Main page. Compound words are only available to texts in Simplified style (as used in Mainland China). Compounds for Traditional style (as used in Taiwan) are not yet available.
1.2 10/19/2006 Fix a regression bug. The punctuation marks in the Chinese text are now recognized
Enhancement: the program will now identify words that have multiple translations ( i.e. multiple Han Viets or multiple Pin Yins). It will try to disambiguate the translations by matching the text that begins with the selected character. Only up to 4 characters, or up to the next punctuation mark, will be used to match  against the compound words in the data base. If a match occurs, the PinYin and HanViet from the compounds will be displayed instead of those from Thiều Chửu. Only characters from a special list will be used for this disambiguation.algorithm.
1.3 10/22/2006 (a) Include links to all Chinese characters on "Chu Thich" pages. Links are available for characters in either Simplified or Traditional style.
(b) Add enhancement so that the pop-ups are displayed in layered windows, offset by 25 pixels from the previous one. The first pop-up is offset  by100 pixels to the right and down from the upper left corner of the screen.
(c) Add enhancement so that the pop-ups are resizable
1.3.1 10/23/2006 (a) Put the focus on the newly created Chu Thich pop-up so that it is not hidden behind the Main page
(b) Fix 10 bugs ( mostly incorrect HanViets )
1.3.2 10/25/2006 (a) Fix 10 bugs (mostly incorrect HanViets in Thiều Chửu )
1.3.3 01/25/2007 Fix a major bug ( when the input text is in Traditional style, but the output is in Simplified style, the links for Chu Thich are still associated with the Traditional; this causes a problem for compounds - which are linked to Simplifieds).
 
1.3.4 2/13/2007 Display the Vietnamese in input text as HTML to get rid of all the question marks
1.3.5 3/24/2007 (Private Version) allow display of all compounds that contain the  Chinese character clicked by the user, not just those compounds that begin with the clicked character. Also fix several bugs in the HanViet translations (đạn v/s đàn, súc v/s súc,thúc, ủy v/s úy)
1.4.0 3/30/2007 (All Versions) display all compounds that match the line which contains the character clicked by the user. Previous versions only match the compounds that start with the clicked character. Beginning with this version, the clicked character can appear anywhere in the compound, as long as the compound matches the line at the character. Again this works only for Simplied characters
1.4.1 5/19/2007 Fix a bug in the compound data base which causes some entries missing
     
1.5.0 6/28/2008 Add the following entries ( new format, with examples )

吧 哪 搞 爸 镇 啦 瓶 储 弯 锐 挡 啥 晃 盯 砖 辣 溜 苗 扔 砸
乒铀 钚 囊 陕 灶 捡 甩 哎 垮 趴 乓 憋 啪 氯 嘿 舱 愣 捂 胳
皂啤 圳 嗦 嗯 瞅 哟 榴 咧 蹦 氧 嘟 碳 嘎 嗡 挚 茬 泵 砰 粤
啃阎 嘀 祷 甭 攥 惦 嗨 尴 柿 烯 咚 歼 氨 拽 捅 唬 颐 揍 掰
荔哧 挎 氢 跤 惮 铬 坞 碴 钊 簌 喳 叼 吆 嚓 杴 殡 噗 烷 嗝
锹跺 嫣 抿 嗖 沏 瞟 哒 耷 蛐 酚 钴 踮 咔 嗬 湮 胺 噔 酯 酰
龇苯

 

1.6.0 6/30/2010 If the headword has multiple PinYins, the compound (that contains the headword) will have a subindex to indicate what PinYin is being used for the compound. No sub index is provided if there is just one PinYin for the headword

The py: field of the
Thiều Chửu dictionary is now aligned with those in the WenLin dictionary
2.0 7/07/2010 For Pin-Yin disambiguation algorithm, his version will match every Chinese word in the Traditional style or Simplified texts - if the word has multiple Pin-Yins - against the compounds found in the WenLin Dictionary. If there are multiple Pin-Yin matches, the longest compound will be used to extract the matching Pin-Yins. These Pin-Yins will be used to select the Han-Viets if the HanViets can be disambiguated using their Pin-Yins.

This version will elevate texts in Traditional style to the same level as texts in Simplified style. It will produce the matching compounds in Traditional style just as it did for texts in Simplified style.

For this version, texts cannot contain more than 5000 Chinese characters. CTHV will not process texts that exceed this limit. This limit is intended to prevent some users who hog the
CTHV program with massive texts and as a result, shut out other legitimate users or even crash the system. This limit is also a necessity with the implementation of the new disambiguation algorithm which requires a lot more memory. This limitation is in line with the design goal CTHV as a learning tool, not as a production tool.
 

2.1.0 7/9/2010 PinYins extracted by matching the setence against the compounds words ( in either Simplifiedd or Traditional Style)  from the WenLin dictionary will have a different color from the PinYins extracted from a Master List
 
2.2.0 7/16/2010 PinYins extracted by matching the sentence against the compounds words ( in either Simplifiedd or Traditional Style)  from the WenLin dictionary will have a bold style ( instead of a different color as in version 2.1.0 )

HanViets extracted by matching the sentence against the compounds words ( in either Simplifiedd or Traditional Style)  from the WenLin dictionary will have a bold style

2.2.1 7/17/2010 ASCII characters and Punctuations in Chinese text are displayed in style that uses Arial font. This is to make them stand close to each other. Without this change, they wil stand far apart (in SimSun font)

Bug Fix: if a Chinese character is not found in WenLin, no PinYin is produced. With this version, the PinYins from a master list ( independent from TC Dictionary) will be used instead
 

2.3.0 9/2/2010 Reduce limit of text size to fix the performance problem. From now on, Input texts cannot contain more than 2000 Chinese characters.