 |  |  |  |  |
 | No. of Sessions | Topics | Readings |  |
 | 1-3 | Introduction to Text Analytics | |  |
 | | Origin of Text Mining - Understanding Text – Applications – Information Visualization - Architecture for Text Mining Applications. | Chapter 1 (R1) |  |
 | | Mathematics Background: Probability - Bayes’s Rule - Probability Distribution Sampling Distribution - Matrices | Chapter 2 (R1) |  |
 | 4-7 | Determining the vocabulary of terms | |  |
 | | Parsing unstructured text I: keywords, n-grams | |  |
 | | Markov Models and POS Tagging | Chapter 4 (R1) |  |
 | | Parsing unstructured text II: parse tree, stemming, lexicon and ontology | |  |
 | | Application - Stanfor NLP Kit | |  |
 | 8 | Treating Text as Data - Features | Chapter 6 (R2) |  |
 | | Scoring, term weighting and the vector space model | |  |
 | 9-11 | Exploratory analytics I: text clustering | |  |
 | | Mathematics Background: Clustering techniques | Chapter 16 and 17 (R2) |  |
 | | Applications of Text clustering - Document clustering | Chapter 8 (R1) |  |
 | 12-14 | Exploratory analytics II: topic modeling | |  |
 | | Mathematics Background: Matrix decompositions and latent semantic indexing and Bayesian Distribution | Chapter 18 (R2) |  |
 | | LSA, LDA and Word2Vec | Materials will be provided |  |
 | 15 | Text Summarization | Chapter 10 (R1) |  |
 | 16-18 | Predictive analytics: text classification | |  |
 | | Mathematics Background: Suprevised Learning Algorithms | Reference book 3 |  |
 | | Applications of Supervised Learning Algorithm | |  |
 | 19 | Application: Sentiment analysis (Non-Supervised Learning methods) | |  |
 | 20 | Project presentation | |  |
 |  |  |  |  |
 | Readings |  |  |  |
 | R1 | Text Mining Application Programming by Manu Konchandy |  |  |
 | R2 | Introduction to Information Retrieval by Manning Christopher D., Raghavan Prabhakar |  |  |
 | Reference |  |  |  |
 | 1 | Foundations of Statistical Natural Language Processing (The MIT Press) by Christopher Manning |  |  |
 | 2 | Thomas W. Miller, Prentice Hall, “Data and Text Mining - A Business Applications Approach”, Second impression, 2011 |  |  |
 | 3 | Data Mining and Predictive Analytics, 2nd Edition by Daniel T. Larose, Chantal D. Larose |  |  |
 | Evaluation |  |  |  |
 | 1 | Attendance and Class Participation 10% |  |  |
 | 2 | Mid Term of 25% weight |  |  |
 | 3 | Individual projects of 25% weight |  |  |
 | 4 | End-term of 40% weight |  |  |