TOP
紅利積點抵現金,消費購書更貼心
Data Mining Methods for the Content Analyst ─ An Introduction to the Computational Analysis of Content
90折

Data Mining Methods for the Content Analyst ─ An Introduction to the Computational Analysis of Content

商品資訊

定價
:NT$ 2579 元
優惠價
902321
無庫存,下單後進貨(到貨天數約45-60天)
下單可得紅利積點 :69 點
商品簡介
作者簡介
目次

商品簡介

With continuous advancements and an increase in user popularity, data mining technologies serve as an invaluable resource for researchers across a wide range of disciplines in the humanities and social sciences. In this comprehensive guide, author and research scientist Kalev Leetaru introduces the approaches, strategies, and methodologies of current data mining techniques, offering insights for new and experienced users alike.
Designed as an instructive reference to computer-based analysis approaches, each chapter of this resource explains a set of core concepts and analytical data mining strategies, along with detailed examples and steps relating to current data mining practices. Every technique is considered with regard to context, theory of operation and methodological concerns, and focuses on the capabilities and strengths relating to these technologies. In addressing critical methodologies and approaches to automated analytical techniques, this work provides an essential overview to a broad innovative field.

作者簡介

Kalev Leetaru is Senior Research Scientist for Content Analysis at the University of Illinois Institute for Computing in Humanities, Arts, and Social Science and Center Affiliate of the National Center for Supercomputing Applications. He leads a number of large initiatives centering on the application of high performance computing to grand challenge problems using massive-scale document and data archives.

目次

Chapter 1 - Introduction


What Is Content Analysis?


Why Use Computerized Analysis Techniques?


Standalone Tools Or Integrated Suites


Transitioning From Theory To Practice


Chapter 2 - Obtaining And Preparing Data



Collecting Data From Digital Text Repositories



Are The Data Meaningful?


Using Data In Unintended Ways


Analytical Resolution


Types Of Data Sources


Finding Sources


Searching Text Collections


Sources Of Incompleteness


Licensing Restrictions And Content Blackouts


Measuring Viewership


Accuracy And Convenience Samples


Random Samples


Multimedia Content



Converting To Textual Format


Prosody


Example Data Sources



Patterns In Historical War Coverage


Competitive Intelligence


Global News Coverage


Downloading Content



Digital Content


Print Content


Preparing Content



Document Extraction


Cleaning


Post Filtering


Reforming/Reshaping


Content Proxy Extraction


Chapter 3 - Vocabulary Analysis



The Basics



Word Histograms


Readability Indexes


Normative Comparison


Non-Word Analysis


Colloquialisms: Abbreviations And Slang


Restricting The Analytical Window


Vocabulary Comparison And Evolution / Chronemics


Advanced Topics



Syllables, Rhyming, And ‘Sounds Like’


Gender And Language


Authorship Attribution


Word Morphology, Stemming, And Lemmatization


Chapter 4 – Correlation And Co-Occurrence



Understanding Correlation


Computing Word Correlations


Directionality


Concordance


Co-Occurrence And Search


Language Variation And Lexicons


Non-Co-Occurrence


Correlation With Metadata


Chapter 5 – Lexicons, Entity Extraction, And Geocoding



Lexicons



Lexicons And Categorization


Lexical Correlation


Lexicon Consistency Checks


Thesauri And Vocabulary Expanders


Named Entity Extraction



Lexicons And Processing


Applications


Geocoding, Gazetteers, And Spatial Analysis



Geocoding


Gazetteers And The Geocoding Process


Operating Under Uncertainty


Spatial Analysis


Chapter 6 – Topic Extraction



How Machines Process Text



Unstructured Text


Extracting Meaning From Text


Applications Of Topic Extraction



Comparing/Clustering Documents


Automatic Summarization


Automatic Keyword Generation


Multilingual Analysis: Topic Extraction With Multiple Languages


Chapter 7 – Sentiment Analysis



Examining Emotions



Evolution


Evaluation


Analytical Resolution: Documents vs Objects


Hand-Crafted vs Automatically-Generated Lexicons


Other Sentiment Scales


Limitations


Measuring Language Rather Than Worldview


Chapter 8 – Similarity, Categorization and Clustering



Categorization



The Vector-Space Model


Feature Selection


Feature Reduction


Learning Algorithm


Evaluating ATC Results


Benefits of ATC Over Human Categorization


Limitations of ATC


Applications of ATC


Clustering



Automated Clustering


Hierarchical Clustering


Partitional Clustering


Document Similarity



Vector Space Model


Contingency Tables


Chapter 9 – Network Analysis



Understanding Network Analysis


Network Content Analysis


Representing Network Data


Constructing the Network


Network Structure


The Triad Census


Network Evolution


Visualization and Clustering

購物須知

外文書商品之書封,為出版社提供之樣本。實際出貨商品,以出版社所提供之現有版本為主。部份書籍,因出版社供應狀況特殊,匯率將依實際狀況做調整。

無庫存之商品,在您完成訂單程序之後,將以空運的方式為你下單調貨。為了縮短等待的時間,建議您將外文書與其他商品分開下單,以獲得最快的取貨速度,平均調貨時間為1~2個月。

為了保護您的權益,「三民網路書店」提供會員七日商品鑑賞期(收到商品為起始日)。

若要辦理退貨,請在商品鑑賞期內寄回,且商品必須是全新狀態與完整包裝(商品、附件、發票、隨貨贈品等)否則恕不接受退貨。

優惠價:90 2321
無庫存,下單後進貨
(到貨天數約45-60天)

暢銷榜

客服中心

收藏

會員專區