TOP
GOGO開學趣,參考書應有盡有
Practical Text Mining With Perl
滿額折

Practical Text Mining With Perl

商品資訊

定價
:NT$ 7197 元
優惠價
906477
若需訂購本書,請電洽客服 02-25006600[分機130、131]。
商品簡介
作者簡介
名人/編輯推薦
目次

商品簡介

Provides readers with the methods, algorithms, and means to perform text mining tasks
This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet (www.perl.org). It covers mining ideas from several perspectives--statistics, data mining, linguistics, and information retrieval--and provides readers with the means to successfully complete text mining tasks on their own.
The book begins with an introduction to regular expressions, a text pattern methodology, and quantitative text summaries, all of which are fundamental tools of analyzing text. Then, it builds upon this foundation to explore:
Probability and texts, including the bag-of-words model
Information retrieval techniques such as the TF-IDF similarity measure
Concordance lines and corpus linguistics
Multivariate techniques such as correlation, principal components analysis, and clustering
Perl modules, German, and permutation tests
Each chapter is devoted to a single key topic, and the author carefully and thoughtfully introduces mathematical concepts as they arise, allowing readers to learn as they go without having to refer to additional books. The inclusion of numerous exercises and worked-out examples further complements the book's student-friendly format.
Practical Text Mining with Perl is ideal as a textbook for undergraduate and graduate courses in text mining and as a reference for a variety of professionals who are interested in extracting information from text documents.

作者簡介

Roger Bilisoly, PhD, is an Assistant Professor of Statistics at Central Connecticut State University, where he developed and teaches a new graduate-level course in text mining for the school's data mining program.

名人/編輯推薦

"Practical Text Mining with Perl is an excellent book for readers at a variety of different programming skill levels … Bilisoly's book would serve as a good text for an introductory text mining course, and could be supplemented with lecture notes for Web mining or data mining courses." (Journal of Statistical Software, January 2009)

目次

List of Figures.
List of Tables.
Preface.
Acknowledgments.
1. Introduction.
1.1 Overview of this Book.
1.2 Text Mining and Related Fields.
1.3 Advice for Reading this Book.
2. Text Patterns.
2.1 Introduction.
2.2 Regular Expressions.
2.3 Finding Words in a Text.
2.4 Decomposing Poe's "The Tell-Tale Heart" into Words.
2.5 A Simple Concordance.
2.6 First Attempt at Extracting Sentences.
2.7 Regex Odds and Ends.
2.8 References.
3. Quantitative Text Summaries.
3.1 Introduction.
3.2 Scalars, Interpolation, and Context in Perl.
3.3 Arrays and Context in Perl.
3.4 Word Lengths in Poe's "The Tell-Tale Heart".
3.5 Arrays and Functions.
3.6 Hashes.
3.7 Two Text Applications.
3.8 Complex Data Structures.
3.9 References.
3.10 First Transition.
4. Probability and Text Sampling.
4.1 Introduction.
4.2 Probability.
4.3 Conditioned Probability.
4.4 Mean and Variance of random Variables.
4.5 The Bag-of-Words Model for Poe's :The Black Cat".
4.6 The Effect of Sample Size.
4.7 References.
5. Applying Information Retrieval to Text Mining.
5.1 Introduction.
5.2 Counting Letters and Words.
5.3 Text Counts and Vectors.
5.4 The Term-Document Matrix Applied to Poe.
5.5 Matrix Multiplication.
5.6 Functions of Counts.
5.7 Document Similarity.
5.8 References.
6. Concordance Lines and Corpus Linguistics.
6.1 Introduction.
6.2 Sampling.
6.3 Corpus as Baseline.
6.4 Concordancing.
6.5 Collocations and Concordance Lines.
6.6 Applications with References.
6.7 Second Transition.
7. Multivariate Techniques with Text.
7.1 Introduction.
7.2 Basic Statistics.
7.3 Basic Linear Algebra.
7.4 Principal Component Matrices.
7.5 Text Applications.
7.6 Applications and References.
8. Text Clustering.
8.1 Introduction.
8.2 Clustering.
8.3 A Note on Classification.
8.4 References.
8.5 Last Transition.
9. A Sample of Additional Topics.
9.1 Introduction.
9.2 Perl Modules.
9.3 Other Languages: Analyzing Goethe in German.
9.4 Permutation Tests.
9.5 References.
Appendix A. Overview of Perl for Text Mining.
A.1 Basic Data Structures.
A.2 Operators.
A.3 Branching and Looping.
A.4 A Few Functions.
A.5 Introduction to Regular Expressions.
Appendix B. Summary of R used in this Book
B.1 Basics of R.
B.2 This Book's R Code..
References.
Index.

購物須知

外文書商品之書封,為出版社提供之樣本。實際出貨商品,以出版社所提供之現有版本為主。部份書籍,因出版社供應狀況特殊,匯率將依實際狀況做調整。

無庫存之商品,在您完成訂單程序之後,將以空運的方式為你下單調貨。為了縮短等待的時間,建議您將外文書與其他商品分開下單,以獲得最快的取貨速度,平均調貨時間為1~2個月。

為了保護您的權益,「三民網路書店」提供會員七日商品鑑賞期(收到商品為起始日)。

若要辦理退貨,請在商品鑑賞期內寄回,且商品必須是全新狀態與完整包裝(商品、附件、發票、隨貨贈品等)否則恕不接受退貨。

優惠價:90 6477
若需訂購本書,請電洽客服 02-25006600[分機130、131]。

暢銷榜

客服中心

收藏

會員專區