Course Detail
Course Detail
Course Description
| Course | Code | Semester | T+P (Hour) | Credit | ECTS |
|---|---|---|---|---|---|
| DATA MINING and BUSINESS INTELLIGENCE | MIS4112149 | Fall Semester | 3+0 | 3 | 5 |
| Course Program |
| Prerequisites Courses | |
| Recommended Elective Courses |
| Language of Course | English |
| Course Level | First Cycle (Bachelor's Degree) |
| Course Type | Required |
| Course Coordinator | Prof.Dr. Gökhan SİLAHTAROĞLU |
| Name of Lecturer(s) | Assist.Prof. Nesibe MANAV MUTLU |
| Assistant(s) | |
| Aim | To provide students with the ability to create a data warehouse from databases. To gain research skills on these data warehouses by using OLAP and data mining models. To get the level of knowledge to apply data mining algorithms. |
| Course Content | This course contains; Introduction to Data Mining ,Data warehouse –OLAP – ETL process – Database vs. Datawarehouse,Tools Used in Data Mining: Python, R, KNIME, pivoting.(KNIME Installation, Interface, Reading and viewing data),Data Preprocessing (Outliers, Missing data, Normalization, Data Transformation, Binning, Histogram),Association Rules (Shopping Cart Analysis),Supervised Learning, Classification And Decision Trees, Gini / Entropy Brief introduction. (Application: BEARS),Decision Trees (Application: MEDICINE and Wine / KNIME: filtering, train-test, validation, accuracy, Color Manager),Random Forest, Booststrapping, Ensemble Learning, Perfume Application + Telco Application,Regression and Logistic regression,Neural Network Model (principle, Hyperparameters).,Unsupervised Learning / Clustering (Application: Wholesale Customer Data, Fuzzy C-means clustering, Quality measures),Data Reduction, Synthetic Data generation, PCA, clustering with Scatter Plot (DBSCAN),Big Data concepts – HADOOP, spark, MongoDB, NOSQL,Project Discussion & Evaluation. |
| Course Learning Outcomes | Teaching Methods | Assessment Methods |
| 1. Will be able to produce data warehouse from database. | 10, 16, 9 | A |
| 1.1. Explains datamining. | 16, 9 | |
| 1.2. Defines Data Warehouse. | 16, 9 | |
| 1.3. Designs the Star Data Warehouse Model. | 10, 9 | |
| 1.4. Main tables association designs the data warehouse model. | 16, 9 | |
| 2. Will be able to relate Data Mining Models to each other. | 16, 9 | A, F |
| 2.1. Explains data mining models. | 16, 9 | |
| 2.2. Defines the concept of classification. | 16, 9 | |
| 2.3. Defines the concept of clustering. | 16, 9 | |
| 2.4. Defines the concept of connection analysis. | 16, 9 | |
| 3. will be able to apply the classification model. | 16, 9 | A, F |
| 3.1. Defines supervised learning. | 16, 9 | |
| 3.2. Defines the concept of class. | 9 | A |
| 3.3. sorts statistical algorithms. | 16, 9 | |
| 3.4. Applies decision trees. | 16, 9 | F |
| 3.5. Defines decision tree algorithms. | 16, 6, 9 | |
| 3.6. Defines pruning and purity values. | 9 | |
| 4. Will be able to employ the clustering model. | 14, 6, 9 | F |
| 4.1. Defines unsupervised learning. | 6, 9 | |
| 4.2. Explains the concept of clustering. | 16, 9 | |
| 4.3. Sorts clustering algorithms. | 6, 9 | F |
| 4.4. Applies K-means algorithm. | 14, 16, 9 | |
| 4.5. Defines genetic algorithms. | 10, 8, 9 | F |
| 5. Will be able employ the connection analysis model. | 14, 16, 6, 9 | A |
| 5.1. Interprets connection analysis rules. | 14, 6, 9 | |
| 5.2. Explains the concept of leverage. | 16, 9 | A |
| 5.3. Applies the relationship analysis method. | 14, 9 | |
| 5.4. Combines clustering with link analysis. | 2 | F |
| 6. Will be able to employ Data Mining Algorithms. | 2 | F |
| 6.1. Employs classification algorithms on data. | 2 | F |
| 6.2. Employs clustering algorithms on data. | 2 | F |
| 6.3. Employs link analysis algorithms on data. | 2 | F |
| 6.4. Interprets data mining application outputs. | 2 | F |
| Teaching Methods: | 10: Discussion Method, 14: Self Study Method, 16: Question - Answer Technique, 2: Project Based Learning Model, 6: Experiential Learning, 8: Flipped Classroom Learning, 9: Lecture Method |
| Assessment Methods: | A: Traditional Written Exam, F: Project Task |
Course Outline
| Order | Subjects | Preliminary Work |
|---|---|---|
| 1 | Introduction to Data Mining | |
| 2 | Data warehouse –OLAP – ETL process – Database vs. Datawarehouse | |
| 3 | Tools Used in Data Mining: Python, R, KNIME, pivoting.(KNIME Installation, Interface, Reading and viewing data) | |
| 4 | Data Preprocessing (Outliers, Missing data, Normalization, Data Transformation, Binning, Histogram) | |
| 5 | Association Rules (Shopping Cart Analysis) | |
| 6 | Supervised Learning, Classification And Decision Trees, Gini / Entropy Brief introduction. (Application: BEARS) | |
| 7 | Decision Trees (Application: MEDICINE and Wine / KNIME: filtering, train-test, validation, accuracy, Color Manager) | |
| 8 | Random Forest, Booststrapping, Ensemble Learning, Perfume Application + Telco Application | |
| 9 | Regression and Logistic regression | |
| 10 | Neural Network Model (principle, Hyperparameters). | |
| 11 | Unsupervised Learning / Clustering (Application: Wholesale Customer Data, Fuzzy C-means clustering, Quality measures) | |
| 12 | Data Reduction, Synthetic Data generation, PCA, clustering with Scatter Plot (DBSCAN) | |
| 13 | Big Data concepts – HADOOP, spark, MongoDB, NOSQL | |
| 14 | Project Discussion & Evaluation |
| Resources |
| 1. Data Mining Introductory and Advanced Topics, Margaret H. Dunham, Prentice Hall. Knime Application: https://docs.knime.com/ |
| 1. Data Mining Concepts and Techniques , J. Han & M. Kamber, Morgan Kaufman. 2. Big Data in Practice: How 45 Successful Companies Used Big Data Analytics to Deliver Extraordinary Results , Bernard Marr, Wiley, 2016 3. Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy, Cathy O'Neil ,2017 4. Naked Statistics: Stripping the Dread from the Data, Charles Wheelan, 2013 5. Kavram ve Algoritmalarıyla Temel Veri Madenciliği, Gökhan Silahtaroğlu, Papatya Yayıncılık. |
Course Contribution to Program Qualifications
| Course Contribution to Program Qualifications | |||||||
| No | Program Qualification | Contribution Level | |||||
| 1 | 2 | 3 | 4 | 5 | |||
| 1 | Defines the theoretical issues in the field of information and management. | X | |||||
| 2 | Describes the necessary mathematical and statistical methods in the field of information and management. | X | |||||
| 3 | Uses at least one computer program in the field of information and management. | X | |||||
| 4 | Sustains proficiency in a foreign language requiredor information and management studies. | ||||||
| 5 | Prepares informatics/software projects and work in a team. | ||||||
| 6 | Constantly updates himself / herself by following developments in science and technology with an understanding of the importance of lifelong learning through critically evaluating the knowledge and skills that s/he has got.7. Uses theoretical and practical expertise in the field of information and management | ||||||
| 7 | Follows up-to-date technology using a foreign language at least A1 level, holds verbal / written communication skills. | ||||||
| 8 | Follows up-to-date technology using a foreign language at least A1 level, holds verbal / written communication. | ||||||
| 9 | Adopts organizational / institutional and social ethical values. | ||||||
| 10 | Within the framework of community involvement adopts social responsibility principles and takes initiative when necessary. | ||||||
| 11 | Uses and analyses basic facts and data in various disciplines (economics, finance, sociology, law, business) in order to conduct interdisciplinary studies. | ||||||
| 12 | Writes software in different platforms such as desktop, mobile, web on its own and / or in a team. | X | |||||
Assessment Methods
| Contribution Level | Absolute Evaluation | |
| Rate of Midterm Exam to Success | 40 | |
| Rate of Final Exam to Success | 60 | |
| Total | 100 | |
| ECTS / Workload Table | ||||||
| Activities | Number of | Duration(Hour) | Total Workload(Hour) | |||
| Course Hours | 14 | 3 | 42 | |||
| Guided Problem Solving | 10 | 2 | 20 | |||
| Resolution of Homework Problems and Submission as a Report | 2 | 12 | 24 | |||
| Term Project | 7 | 2 | 14 | |||
| Presentation of Project / Seminar | 1 | 3 | 3 | |||
| Quiz | 1 | 4 | 4 | |||
| Midterm Exam | 1 | 15 | 15 | |||
| General Exam | 1 | 20 | 20 | |||
| Performance Task, Maintenance Plan | 0 | 0 | 0 | |||
| Total Workload(Hour) | 142 | |||||
| Dersin AKTS Kredisi = Toplam İş Yükü (Saat)/30*=(142/30) | 5 | |||||
| ECTS of the course: 30 hours of work is counted as 1 ECTS credit. | ||||||
Detail Informations of the Course
Course Description
| Course | Code | Semester | T+P (Hour) | Credit | ECTS |
|---|---|---|---|---|---|
| DATA MINING and BUSINESS INTELLIGENCE | MIS4112149 | Fall Semester | 3+0 | 3 | 5 |
| Course Program |
| Prerequisites Courses | |
| Recommended Elective Courses |
| Language of Course | English |
| Course Level | First Cycle (Bachelor's Degree) |
| Course Type | Required |
| Course Coordinator | Prof.Dr. Gökhan SİLAHTAROĞLU |
| Name of Lecturer(s) | Assist.Prof. Nesibe MANAV MUTLU |
| Assistant(s) | |
| Aim | To provide students with the ability to create a data warehouse from databases. To gain research skills on these data warehouses by using OLAP and data mining models. To get the level of knowledge to apply data mining algorithms. |
| Course Content | This course contains; Introduction to Data Mining ,Data warehouse –OLAP – ETL process – Database vs. Datawarehouse,Tools Used in Data Mining: Python, R, KNIME, pivoting.(KNIME Installation, Interface, Reading and viewing data),Data Preprocessing (Outliers, Missing data, Normalization, Data Transformation, Binning, Histogram),Association Rules (Shopping Cart Analysis),Supervised Learning, Classification And Decision Trees, Gini / Entropy Brief introduction. (Application: BEARS),Decision Trees (Application: MEDICINE and Wine / KNIME: filtering, train-test, validation, accuracy, Color Manager),Random Forest, Booststrapping, Ensemble Learning, Perfume Application + Telco Application,Regression and Logistic regression,Neural Network Model (principle, Hyperparameters).,Unsupervised Learning / Clustering (Application: Wholesale Customer Data, Fuzzy C-means clustering, Quality measures),Data Reduction, Synthetic Data generation, PCA, clustering with Scatter Plot (DBSCAN),Big Data concepts – HADOOP, spark, MongoDB, NOSQL,Project Discussion & Evaluation. |
| Course Learning Outcomes | Teaching Methods | Assessment Methods |
| 1. Will be able to produce data warehouse from database. | 10, 16, 9 | A |
| 1.1. Explains datamining. | 16, 9 | |
| 1.2. Defines Data Warehouse. | 16, 9 | |
| 1.3. Designs the Star Data Warehouse Model. | 10, 9 | |
| 1.4. Main tables association designs the data warehouse model. | 16, 9 | |
| 2. Will be able to relate Data Mining Models to each other. | 16, 9 | A, F |
| 2.1. Explains data mining models. | 16, 9 | |
| 2.2. Defines the concept of classification. | 16, 9 | |
| 2.3. Defines the concept of clustering. | 16, 9 | |
| 2.4. Defines the concept of connection analysis. | 16, 9 | |
| 3. will be able to apply the classification model. | 16, 9 | A, F |
| 3.1. Defines supervised learning. | 16, 9 | |
| 3.2. Defines the concept of class. | 9 | A |
| 3.3. sorts statistical algorithms. | 16, 9 | |
| 3.4. Applies decision trees. | 16, 9 | F |
| 3.5. Defines decision tree algorithms. | 16, 6, 9 | |
| 3.6. Defines pruning and purity values. | 9 | |
| 4. Will be able to employ the clustering model. | 14, 6, 9 | F |
| 4.1. Defines unsupervised learning. | 6, 9 | |
| 4.2. Explains the concept of clustering. | 16, 9 | |
| 4.3. Sorts clustering algorithms. | 6, 9 | F |
| 4.4. Applies K-means algorithm. | 14, 16, 9 | |
| 4.5. Defines genetic algorithms. | 10, 8, 9 | F |
| 5. Will be able employ the connection analysis model. | 14, 16, 6, 9 | A |
| 5.1. Interprets connection analysis rules. | 14, 6, 9 | |
| 5.2. Explains the concept of leverage. | 16, 9 | A |
| 5.3. Applies the relationship analysis method. | 14, 9 | |
| 5.4. Combines clustering with link analysis. | 2 | F |
| 6. Will be able to employ Data Mining Algorithms. | 2 | F |
| 6.1. Employs classification algorithms on data. | 2 | F |
| 6.2. Employs clustering algorithms on data. | 2 | F |
| 6.3. Employs link analysis algorithms on data. | 2 | F |
| 6.4. Interprets data mining application outputs. | 2 | F |
| Teaching Methods: | 10: Discussion Method, 14: Self Study Method, 16: Question - Answer Technique, 2: Project Based Learning Model, 6: Experiential Learning, 8: Flipped Classroom Learning, 9: Lecture Method |
| Assessment Methods: | A: Traditional Written Exam, F: Project Task |
Course Outline
| Order | Subjects | Preliminary Work |
|---|---|---|
| 1 | Introduction to Data Mining | |
| 2 | Data warehouse –OLAP – ETL process – Database vs. Datawarehouse | |
| 3 | Tools Used in Data Mining: Python, R, KNIME, pivoting.(KNIME Installation, Interface, Reading and viewing data) | |
| 4 | Data Preprocessing (Outliers, Missing data, Normalization, Data Transformation, Binning, Histogram) | |
| 5 | Association Rules (Shopping Cart Analysis) | |
| 6 | Supervised Learning, Classification And Decision Trees, Gini / Entropy Brief introduction. (Application: BEARS) | |
| 7 | Decision Trees (Application: MEDICINE and Wine / KNIME: filtering, train-test, validation, accuracy, Color Manager) | |
| 8 | Random Forest, Booststrapping, Ensemble Learning, Perfume Application + Telco Application | |
| 9 | Regression and Logistic regression | |
| 10 | Neural Network Model (principle, Hyperparameters). | |
| 11 | Unsupervised Learning / Clustering (Application: Wholesale Customer Data, Fuzzy C-means clustering, Quality measures) | |
| 12 | Data Reduction, Synthetic Data generation, PCA, clustering with Scatter Plot (DBSCAN) | |
| 13 | Big Data concepts – HADOOP, spark, MongoDB, NOSQL | |
| 14 | Project Discussion & Evaluation |
| Resources |
| 1. Data Mining Introductory and Advanced Topics, Margaret H. Dunham, Prentice Hall. Knime Application: https://docs.knime.com/ |
| 1. Data Mining Concepts and Techniques , J. Han & M. Kamber, Morgan Kaufman. 2. Big Data in Practice: How 45 Successful Companies Used Big Data Analytics to Deliver Extraordinary Results , Bernard Marr, Wiley, 2016 3. Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy, Cathy O'Neil ,2017 4. Naked Statistics: Stripping the Dread from the Data, Charles Wheelan, 2013 5. Kavram ve Algoritmalarıyla Temel Veri Madenciliği, Gökhan Silahtaroğlu, Papatya Yayıncılık. |
Course Contribution to Program Qualifications
| Course Contribution to Program Qualifications | |||||||
| No | Program Qualification | Contribution Level | |||||
| 1 | 2 | 3 | 4 | 5 | |||
| 1 | Defines the theoretical issues in the field of information and management. | X | |||||
| 2 | Describes the necessary mathematical and statistical methods in the field of information and management. | X | |||||
| 3 | Uses at least one computer program in the field of information and management. | X | |||||
| 4 | Sustains proficiency in a foreign language requiredor information and management studies. | ||||||
| 5 | Prepares informatics/software projects and work in a team. | ||||||
| 6 | Constantly updates himself / herself by following developments in science and technology with an understanding of the importance of lifelong learning through critically evaluating the knowledge and skills that s/he has got.7. Uses theoretical and practical expertise in the field of information and management | ||||||
| 7 | Follows up-to-date technology using a foreign language at least A1 level, holds verbal / written communication skills. | ||||||
| 8 | Follows up-to-date technology using a foreign language at least A1 level, holds verbal / written communication. | ||||||
| 9 | Adopts organizational / institutional and social ethical values. | ||||||
| 10 | Within the framework of community involvement adopts social responsibility principles and takes initiative when necessary. | ||||||
| 11 | Uses and analyses basic facts and data in various disciplines (economics, finance, sociology, law, business) in order to conduct interdisciplinary studies. | ||||||
| 12 | Writes software in different platforms such as desktop, mobile, web on its own and / or in a team. | X | |||||
Assessment Methods
| Contribution Level | Absolute Evaluation | |
| Rate of Midterm Exam to Success | 40 | |
| Rate of Final Exam to Success | 60 | |
| Total | 100 | |