Data Science Projects
Data Science Projects
Data Science Projects
With effective
Syllabus for Master of Computer Applications, 4th Semester from academic
Subject Name: Software Project -2 (Data Science) year 2018-19
Subject Code: 4649304
1. Learning Objectives:
To be able to develop Data Science Project using open source technologies
To learn Data Processing, Visualization and Analytical techniques on data set
3. General Guidelines for Data Science Project using Open source Technologies
5. Expected Outcome
1) The objective of the Data Science Project Development is to make students aware about
the industry based process and workings. As a result, Project must meet with the industry
standards.
2) There will not be any compulsion to prepare a project report for the students but an
application and supportive documents should be self-explanatory, so that evaluator may
get the detail about the Project developed and can evaluate the students as per the
evaluation criteria.
Group size: 2-3 Persons.
3) Power Point Presentation Content (30 Slides Max.):
Page no. 1 of 4
GUJARAT TECHNOLOGICAL UNIVERSITY
With effective
Syllabus for Master of Computer Applications, 4th Semester from academic
Subject Name: Software Project -2 (Data Science) year 2018-19
Subject Code: 4649304
6. Suggested
PS: Below list (a & b) are suggestive one. You may select any other relevant topics/Data
Sets.
a) Project Definitions
1) A Study on Employee Attrition Prediction and Analysis
2) A Study on Student Dropout Prediction and Analysis
3) A Study on Student Result Prediction and Analysis
4) A Study on Heights and Weights Data
5) A Study on Loan Prediction and Analysis
6) A Study on Housing Data
7) A Study on Weather Data
8) A Study on Movie Lens ( https://movielens.org)
9) A Study on Trip Data
10) A Study on Census and Income Data
11) A Study on Songs Data
12) A Study on Sales Data
13) A Study on Online Shopping Data
14) A Study on Cyber Crime Data
15) A Study on Airline Safety
16) A Study on Spam emails /Get rid of Spam emails ()
17) A Study on Pictures / Working with Pictures
18) Working with Handwritten Information
19) Analyzing Reviews ( e.g. amazon.com)
Page no. 2 of 4
GUJARAT TECHNOLOGICAL UNIVERSITY
With effective
Syllabus for Master of Computer Applications, 4th Semester from academic
Subject Name: Software Project -2 (Data Science) year 2018-19
Subject Code: 4649304
7. Evaluation
Sr. No Particulars Weightage
1 Topic & Selection of Algorithm 10%
2 Data Pre-processing ( Cleaning, Reducing 20%
Dimensionality )
3 Data Visualization 20%
4 Data Analysis / Algorithm 20%
5 Result 30%
Page no. 3 of 4
GUJARAT TECHNOLOGICAL UNIVERSITY
With effective
Syllabus for Master of Computer Applications, 4th Semester from academic
Subject Name: Software Project -2 (Data Science) year 2018-19
Subject Code: 4649304
Recommended Book(s):
1) Field Cady, 'The Data Science Handbook ', Wiley Publication ISBN-13: 978-1119092940
2) Jake VanderPlas, ‘Python Data Science Handbook ESSENTIAL TOOLS FOR
WORKING WITH DATA’, O’REILLY ISBN:978-1-491-91205-8
3) Rachel Schutt and Cathy O’Neil, Doing Data Science, O'REILLY
4) Wes McKinney,Python for Data Analysis Data Wrangling with Pandas, NumPy, and
IPython, 2nd Edition , O'REILLY
5) Anand Rajaraman and Jeffrey David Ullman, “Mining of Massive Datasets”, Cambridge
University Press, 2012
6) John W. Foreman (Author), Data Smart: Using Data Science to Transform Information
into Insight, WILEY
7) John Paul Mueller, Luca Massaron, Python for Data Science For Dummies , WILEY
Page no. 4 of 4