Word Segmentation

【Research Note】Infrastructure of Text Analysis: Building and Evaluating the Lexicon for Taiwan Legislative Studies

An increasing number of legislative studies have applied natural language processing skills to replace human labor in identifying topics, positions and sentiments underlying legislative texts. Taiwan’s parliament is the only democratic parliament in the world that uses Chinese as the official language. Studies of Taiwan’s legislative record are critical to the development of comparative legislative studies and Chinese text analysis. However, the distribution of terms that appear in legislative records is different from the distribution of those seen in daily conversations.

Subscribe to RSS - Word Segmentation