Text Processing in Python

书名:Text Processing in Python
作者:DavidMertz
译者:
ISBN:9780321112545
出版社:Addison-WesleyProfessional
出版时间:2003-6-12
格式:epub/mobi/azw3/pdf
页数:544
豆瓣评分: 7.5

书籍简介:

Text Processing in Python describes techniques for manipulation of text using the Python programming language. At the broadest level, text processing is simply taking textual information and doing something with it. This might be restructuring or reformatting it, extracting smaller bits of information from it, or performing calculations that depend on the text. Text processing is arguably what most programmers spend most of their time doing. Because Python is clear, expressive, and object-oriented it is a perfect language for doing text processing, even better than Perl. As the amount of data everywhere continues to increase, this is more and more of a challenge for programmers. This book is not a tutorial on Python. It has two other goals: helping the programmer get the job done pragmatically and efficiently; and giving the reader an understanding – both theoretically and conceptually – of why what works works and what doesn't work doesn't work. Mertz provides practical pointers and tips that emphasize efficent, flexible, and maintainable approaches to the textprocessing tasks that working programmers face daily. From the Back Cover: Text Processing in Python is an example-driven, hands-on tutorial that carefully teaches programmers how to accomplish numerous text processing tasks using the Python language. Filled with concrete examples, this book provides efficient and effective solutions to specific text processing problems and practical strategies for dealing with all types of text processing challenges. Text Processing in Python begins with an introduction to text processing and contains a quick Python tutorial to get you up to speed. It then delves into essential text processing subject areas, including string operations, regular expressions, parsers and state machines, and Internet tools and techniques. Appendixes cover such important topics as data compression and Unicode. A comprehensive index and plentiful cross-referencing offer easy access to available information. In addition, exercises throughout the book provide readers with further opportunity to hone their skills either on their own or in the classroom. A companion Web site (http://gnosis.cx/TPiP) contains source code and examples from the book. Here is some of what you will find in thie book: * When do I use formal parsers to process structured and semi-structured data? Page 257 * How do I work with full text indexing? Page 199 * What patterns in text can be expressed using regular expressions? Page 204 * How do I find a URL or an email address in text? Page 228 * How do I process a report with a concrete state machine? Page 274 * How do I parse, create, and manipulate internet formats? Page 345 * How do I handle lossless and lossy compression? Page 454 * How do I find codepoints in Unicode? Page 465

作者简介:

书友短评:

@ JeffChen 这本书是2007年的时候就开始看了吧,现在全忘记了 @ ヘリ After reading,I think I should using awk or perl instead of py in text processing. Orz @ Sea 搞文本处理的可以看看 @ 闻亮 翻译中… @ Menghan 这书有点旧,开始的函数式还真有点难懂后面介绍模块的部分确实不错 @ 流镡随语 没有读完这本书,写作风格不是太喜欢,不过可以作为一个手册,当做文本处理工作时,可以翻翻相应的专题,参考参考代码 @ ヘリ After reading,I think I should using awk or perl instead of py in text processing. Orz @ simeon.chaos 我的兴趣我的收获 @ Serendipity 说实话,当时读了个开头。。 @ JeffChen 这本书是2007年的时候就开始看了吧,现在全忘记了

添加微信公众号:好书天下获取

添加微信公众号:“好书天下”获取书籍好书天下 » Text Processing in Python
分享到: 更多 (0)

评论 抢沙发

评论前必须登录!

 

添加微信公众号:“好书天下”获取书籍

添加微信公众号:“好书天下”获取书籍添加微信公众号:“好书天下”获取书籍