jieba - PyPI
文章推薦指數: 80 %
“结巴”中文分词:做最好的Python 中文分词组件. “Jieba” (Chinese for “to stutter”) Chinese text segmentation: built to be the best Python Chinese word ...
Skiptomaincontent
Switchtomobileversion
SearchPyPI
Search
jieba0.42.1
pipinstalljieba
CopyPIPinstructions
Latestversion
Released:
Jan20,2020
ChineseWordsSegmentationUtilities
Navigation
Projectdescription
Releasehistory
Downloadfiles
Projectlinks
Homepage
Statistics
GitHubstatistics:
Stars:
Forks:
Openissues/PRs:
ViewstatisticsforthisprojectviaLibraries.io,orbyusingourpublicdatasetonGoogleBigQuery
Meta
License:MITLicense(MIT)
Author:Sun,Junyi
Tags
NLP
Maintainers
fxsjy
Classifiers
IntendedAudience
Developers
License
OSIApproved::MITLicense
NaturalLanguage
Chinese(Simplified)
Chinese(Traditional)
OperatingSystem
OSIndependent
ProgrammingLanguage
Python
Python::2
Python::2.6
Python::2.7
Python::3
Python::3.2
Python::3.3
Python::3.4
Topic
TextProcessing
TextProcessing::Indexing
TextProcessing::Linguistic
Projectdescription
Projectdetails
Releasehistory
Downloadfiles
Projectdescription
jieba
“结巴”中文分词:做最好的Python中文分词组件
“Jieba”(Chinesefor“tostutter”)Chinesetextsegmentation:builtto
bethebestPythonChinesewordsegmentationmodule.
完整文档见README.md
GitHub:https://github.com/fxsjy/jieba
特点
支持三种分词模式:
精确模式,试图将句子最精确地切开,适合文本分析;
全模式,把句子中所有的可以成词的词语都扫描出来,
速度非常快,但是不能解决歧义;
搜索引擎模式,在精确模式的基础上,对长词再次切分,提高召回率,适合用于搜索引擎分词。
支持繁体分词
支持自定义词典
MIT授权协议
在线演示:http://jiebademo.ap01.aws.af.cm/
安装说明
代码对Python2/3均兼容
全自动安装:easy_installjieba或者pipinstalljieba/pip3installjieba
半自动安装:先下载https://pypi.python.org/pypi/jieba/,解压后运行
pythonsetup.pyinstall
手动安装:将jieba目录放置于当前目录或者site-packages目录
通过importjieba来引用
Projectdetails
Projectlinks
Homepage
Statistics
GitHubstatistics:
Stars:
Forks:
Openissues/PRs:
ViewstatisticsforthisprojectviaLibraries.io,orbyusingourpublicdatasetonGoogleBigQuery
Meta
License:MITLicense(MIT)
Author:Sun,Junyi
Tags
NLP
Maintainers
fxsjy
Classifiers
IntendedAudience
Developers
License
OSIApproved::MITLicense
NaturalLanguage
Chinese(Simplified)
Chinese(Traditional)
OperatingSystem
OSIndependent
ProgrammingLanguage
Python
Python::2
Python::2.6
Python::2.7
Python::3
Python::3.2
Python::3.3
Python::3.4
Topic
TextProcessing
TextProcessing::Indexing
TextProcessing::Linguistic
Releasehistory
Releasenotifications|
RSSfeed
Thisversion
0.42.1
Jan20,2020
0.42
Jan13,2020
0.41
Jan8,2020
0.40
Dec25,2019
0.39
Aug28,2017
0.38
Dec16,2015
0.37
Jun27,2015
0.36.2
Apr17,2015
0.36.1
Apr17,2015
0.36
Mar20,2015
0.35
Nov13,2014
0.34
Oct20,2014
0.33
Aug31,2014
0.32
Feb7,2014
0.31
Jul29,2013
0.30
Jul1,2013
0.29.1
Jun17,2013
0.29
Jun7,2013
0.28.4
May31,2013
0.28.3
May2,2013
0.28.2
Apr28,2013
0.28.1
Apr27,2013
0.28
Apr27,2013
0.27
Apr22,2013
0.26.1
Apr7,2013
0.26
Apr7,2013
0.25
Feb18,2013
0.24
Dec28,2012
0.23
Dec12,2012
0.22
Nov27,2012
0.21
Nov23,2012
0.20
Nov6,2012
Downloadfiles
Downloadthefileforyourplatform.Ifyou'renotsurewhichtochoose,learnmoreaboutinstallingpackages.
SourceDistribution
jieba-0.42.1.tar.gz
(19.2MB
viewhashes)
Uploaded
Jan20,2020
source
Close
Hashesforjieba-0.42.1.tar.gz
Hashesforjieba-0.42.1.tar.gz
Algorithm
Hashdigest
SHA256
055ca12f62674fafed09427f176506079bc135638a14e23e25be909131928db2
Copy
MD5
32091ceaebbd38baee82cfe88bd0003b
Copy
BLAKE2-256
c6cb18eeb235f833b726522d7ebed54f2278ce28ba9438e3135ab0278d9792a2
Copy
Close
English
español
français
日本語
português(Brasil)
українська
Ελληνικά
Deutsch
中文(简体)
中文(繁體)
русский
עברית
esperanto
Supportedby
AWS
Cloudcomputing
Datadog
Monitoring
Facebook/Instagram
PSFSponsor
Fastly
CDN
Google
ObjectStorageandDownloadAnalytics
Huawei
PSFSponsor
Microsoft
PSFSponsor
NVIDIA
PSFSponsor
Pingdom
Monitoring
Salesforce
PSFSponsor
Sentry
Errorlogging
StatusPage
Statuspage
延伸文章資訊
- 1jieba - PyPI
“结巴”中文分词:做最好的Python 中文分词组件. “Jieba” (Chinese for “to stutter”) Chinese text segmentation: built t...
- 2Python 结巴分词(jieba)使用方法文档及示例代码 - cjavapy.com
本文主要介绍Python中,结巴分词(jieba)的使用相关介绍文档,使用结巴分词(jieba)进行分词的方法,以及相关的示例代码。
- 3python jieba分詞(結巴分詞)、提取詞,載入詞,修改詞頻
轉載請註明出處"結巴"中文分詞:做最好的Python 中文分片語件,分詞模組jieba,它是python比較好用的分詞模組, 支援中文簡體,繁體分詞,還支援自定義 ...
- 4Python自然語言處理(二):使用jieba進行中文斷詞
结巴中文分词. Contribute to fxsjy/jieba development by creating an account on GitHub. ... Jieba提供了三種分詞模...
- 5fxsjy/jieba: 结巴中文分词
“结巴”中文分词:做最好的Python 中文分词组件. "Jieba" (Chinese for "to stutter") Chinese text segmentation: built t...