CMP614 - TEXT MINING

Course Name Code Semester Theory
(hours/week)
Application
(hours/week)
Credit ECTS
TEXT MINING CMP614 Any Semester/Year 3 0 3 9
PrequisitesNone
Course languageEnglish
Course typeElective 
Mode of DeliveryFace-to-Face 
Learning and teaching strategiesLecture
Preparing and/or Presenting Reports
Problem Solving
Project Design/Management
 
Instructor (s)Asst. Prof. Dr. Gönenç Ercan, Prof. Dr. İlyas Çiçekli 
Course objectiveText data is the most common and vast of information digitally available today. These information sources are usually either non or semi structured. Methods for extracting information that can be processed by computer algorithms will be studied throughout this course. 
Learning outcomes
  1. After completing the course, the students
  2. ? Learn basic methods used in processing text data.
  3. ? Learn statistical topic models and their uses in text mining.
  4. ? Learn pattern based information extraction methods.
  5. ? Learn to use Graph based methods in text mining.
Course Content? Unstructured text processing methods
? Topic models and statistical models.
? Pattern based information extraction methods
? Graph theory based text mining
? Semantic Analysis.
? Apllication of Natural Language Processing.  
References1. Charu Aggarwal and Cheng Xiang Zhei, "Mining Text Data", Springer, 2012.
2. Sholom Weiss, Nitin Indurkhya and Tong Zhang, "Fundamentals of Predictive Text Mining", Springer, 2010.
3. Ronen Feldman and James Sanger, "The Text Mining Handbook", Cambridge Press, 2007.
 

Course outline weekly

WeeksTopics
Week 1Introduction to Text Mining
Week 2Basic Techniques for processing Unstructured Text
Week 3Dimensionality Reduction, Latent Semantic Analysis
Week 4Topic Models: Latent Dirichlet Allocation
Week 5Topic Models: Statistical Models
Week 6Pattern based information extraction methods
Week 7Basic Techniques for processing Semi-structured texts.
Week 8Web Site Scraping and Wrapper Induction
Week 9Graph Based Methods
Week 10Graph Based Methods (continued)
Week 11Text Information Visualization
Week 12Topic segmentation and summarization
Week 13Sentiment and Opinion Analysis
Week 14Project presentations
Week 15
Week 16Final Exam

Assesment methods

Course activitiesNumberPercentage
Attendance00
Laboratory00
Application00
Field activities00
Specific practical training00
Assignments00
Presentation00
Project150
Seminar00
Midterms00
Final exam150
Total100
Percentage of semester activities contributing grade succes150
Percentage of final exam contributing grade succes150
Total100

WORKLOAD AND ECTS CALCULATION

Activities Number Duration (hour) Total Work Load
Course Duration (x14) 14 3 42
Laboratory 0 0 0
Application000
Specific practical training000
Field activities000
Study Hours Out of Class (Preliminary work, reinforcement, ect)1410140
Presentation / Seminar Preparation000
Project16060
Homework assignment000
Midterms (Study duration)000
Final Exam (Study duration) 13030
Total Workload30103272

Matrix Of The Course Learning Outcomes Versus Program Outcomes

D.9. Key Learning OutcomesContrubition level*
12345
1. Graduates should have a mastery of computer science as described by the core of the Body of Knowledge.   X 
2. Graduates need understanding of a number of recurring themes, such as abstraction, complexity, and evolutionary change, and a set of general principles, such as sharing a common resource, security, and concurrency.    X 
3. Graduates of a computer science program need to understand how theory and practice influence each other.    X
4. Graduates need to think at multiple levels of detail and abstraction.   X  
5. Students will be able to think critically, creatively and identify problems in their research.    X
6. Graduates should have been involved in at least one substantial project.     X
7. Graduates should realize that the computing field advances at a rapid pace.     X
8. Graduates should conduct research in an ethical and responsible manner.   X  
9. Graduates should have good command of technical terms in both Turkish and English.    X
10. Graduates should understand the full range of opportunities available in computing.    X
11. Graduates should understand that computing interacts with many different domains.     X
12. Graduates should develop the knowledge acquired at master level and apply scientific methods in order to solve scientific problems.   X  

*1 Lowest, 2 Low, 3 Average, 4 High, 5 Highest