Word Segmentation

词语分割：将一个句子或文本拆分成单独的词语的过程

常用释义

词语分割：将一个句子或文本拆分成单独的词语的过程，通常用于自然语言处理和计算机语言学领域。

扩展信息

分词

word中文 ... word order 词序 word segmentation 分词 ... word method 单字法 ...

断词

汉台翻译的难题之一是中文断词 (Word segmentation) 。拉丁语系字汇间以空格分开，不会有断词的困扰，中文若不经断词，无 …

词切分

... 词义歧义消除（ Word Sense Disambiguation） 词切分（ Word Segmentation）统计型机器翻译（ Statistical Machine Tran…

词语切分

词干切... ... ) word stem syncopation 词干切分 ) Word segmentation 词语切分 ) word entry segmentation of Chinese 词条切分 ...

中文分词

　2-5中文分词(Word Segmentation)　　　　　　　　　　　35　　2-5-1 MMSEG中文分词系统　　　　　　　　　　　　　 36　　2-5-2 CKIP中文分词系统　　　　　　　　　　　　　　...

语词切分

词干切分,word stem... ... ) tokenize 切分词 ) Word segmentation 语词切分 ) phrase segmentation 词组切分 ...

进行分词

...分隔符(如英文中的空格符)，所以要分析句子，首先要进行分词(word segmentation)。

词的切分

词的构成,Word... ... ) word formation 词的形态 ) Word segmentation 词的切分 ) Word-formation 词的构成 ...

例句

Maximum match method is optimized to improve the speed of the system during the word segmentation.

切分过程系统利用改进正向最大匹配算法，提高了分词切分效率。

First of all, this word segmentation system produced all possible segment resulting with the methods of same traditional word segmentation.

首先，综合运用各种传统分词方法，提出所有可能的切分结果，同时建立切词领域本体知识库；

This paper focuses on the word boundary decision (WBD) approach to Chinese word segmentation.

该文研究和探讨一种新的分词方法：基于词边界分类的方法。

Chinese word segmentation is the basis of Chinese information processing, Chinese word segmentation search engine is an application.

中文分词是其他中文信息处理的基础，搜索引擎只是中文分词的一个应用。

Net core, through the effective Chinese word segmentation algorithm to analyze the database contents, index and save to your hard drive.

Net核心，通过高效的中文分词算法将数据库中内容进行分析、索引并保存至硬盘中。

The experimental result of 5000-words test show that the method is better accurate in Uyghur word segmentation.

在一个5000词的测试语料上进行了实验，实验结果表明，使用该方法进行维吾尔语词切分具有更高的准确率。

Combinational ambiguity is one of the difficult points in Chinese word segmentation.

组合型歧义切分是汉语自动分词的难点之一。

In system design , it can find Chinese name by the word segmentation fragment, experiments show that about 90% correct rate is achieved.

采用分词碎片识别中文姓名法，对常见的姓名识别率达到90%左右。

Transliterated person names identification is the necessary part of Chinese word segmentation.

西方姓名译名的自动识别为汉语自动分词不可或缺的组成部分。

Because of the complexity of Chinese, word segmentation has been a difficult problem of NLP.

但由于汉语自身的复杂性，分词问题一直是中文自然语言处理的难题。

The technique of Chinese word segmentation plays an important role in many applications of Chinese information processing.

汉语自动分词在中文信息处理现实应用中占据着十分重要的位置。

The existing maximum matching method of Chinese word segmentation is improved, and also the customized Chinese analyzer is implemented.

改进现有的正向最大匹配中文分词算法，实现定制化的中文分析器；

So, to make the computer capable of handling Chinese text, text must do chinese word segmentation first.

所以，要使计算机能够处理中文文本，就必须先进行中文分词。

This pager proposes an approach for language independent text classification without word segmentation is.

本文提出了一种独立于语种不需分词的文本分类方法。

This paper first research Chinese Word Segmentation, the basis of Chinese part-of-speech tagging technology.

本文首先对词性标注的基础技术——中文分词作了系统的研究。

This paper put emphasis on the technologies of the system, including the topic crawler and the Chinese word segmentation.

着重研究了网络化制造资源垂直搜索系统的主题爬虫和中文分词技术。

Every Chinese information processing System based of Chinese word must depend on word segmentation.

任何基于词一级的中文处理应用系统都离不开分词系统。

Word Segmentation ( WS ) is a fundamental task in Chinese Information Processing.

自动分词是中文信息处理的基础课题之一。

Without Chinese word segmentation, so this algorithm is useful in massive Chinese data corpus.

由于无需汉语分词，本算法适用于海量中文信息处理。

Then after word segmentation, we introduced Hidden Markov Model to identify most of musical entities.

然后，在分词之后引入隐马尔科夫模型来识别大部分音乐实体。

Chinese word segmentation, is to cut the sentence in the vocabulary sub-out process.

所谓中文分词，就是将中文语句中的词汇切分出来的过程。

For Chinese Part-Of-Speech(POS) tagging, word segmentation is a preliminary step.

在中文词法分析中，分词是词性标注必须经历的阶段。

Overlapping ambiguity is a major type of ambiguity in Chinese word segmentation.

交集型分词歧义是汉语自动分词中的主要歧义类型之一。

Chinese automatic word segmentation is to use computer to cut sequential text into character strings based on word units.

中文自动分词，就是利用计算机将连续文本切分为以词为单位的字符序列。

We analyzed, designed and achieved a module of Chinese word segmentation and Part-Of-Speech Tagging based on Condition Random Fields model.

分析、设计和实现了一个基于条件随机场模型的汉语分词和词性标注模块。

Chinese word segmentation is the basis of Chinese language processing.

分词是中文信息处理的基础。

Chinese automatic word segmentation is the fundamental task of the Chinese Information Processing.

中文自动分词是中文信息处理领域的基础课题，也是中文信息处理发展的瓶颈之一。

The system of Chinese word segmentation based on machine learning is researched and implemented.

本文研究并实现了基于机器学习的分词系统。

In those texts, we select bigram as feature after Chinese word segmentation, deleting stop word and other process.

在筛选出的文本中，经过分词、去除停用词等处理后，选取二元词串作为特征；

Chinese Word Segmentation CWS is the basic problem of natural language processing.

中文分词是自然语言处理的基础性问题。