Skip to content

python2.7环境下,常见的字符编码归一化处理

Notifications You must be signed in to change notification settings

Bishoptylaor/String_Filter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

String_Filter

python2.7环境下,常见的字符编码归一化处理及相关小工具

 包括:常见非显示字符处理,
      标点符号的中英文互转和全半角互转
      各种utf-8类型的字符串转成标准Unicode格式。其他格式转Unicode也支持
      hash值计算
      去除网页抓取过程中得到的 \r \n \t 等无效内容

filter_base中是所需的替换结构

About

python2.7环境下,常见的字符编码归一化处理

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages