Eastern Illinois University Logo
Math |

Mathematics and Computer Science

Math Home

Graduate Studies

Office Hours

HS vs College

Research Opportunities for Undergrad & Graduate Students

About Me

Contact Me

MAT 5335 -- Intro to Data -- Spring 2017

This syllabus is tentative and should not be considered definitive. The instructor reserves the right to modify it to meet the needs of the class. It is the student responsibility to attend class regularly and to make note of any changes. Be sure you check the class website for sections covered on the exams, homework problems and announcements.

Catalog Description

Credit Hours: 4; An introduction to data analysis.

Course Objectives

To learn modern computational tools for representing, extracting, manipulating, interpreting, transforming, and visualizing data.

Prerequisites

Graduate standing or permission of instructor.

Textbooks and Such

We will attempt to cover some or all of the following technologies this semester: R, Visualizing Data, UNIX, bash, Regular Expressions, SQL, XML. We will attempt to cover the schedule and material found on Dr. Ward's data analysis page.

Ebooks on the Safari platform are available campus-wide and remotely with an EIU login, with up to 3 simultaneous users. Springer ebook collection has no simultaneous user restriction, and can usually be downloaded as a pdf. Click below to see list of books, websites, and other stuff. This list will be continuously updated.

  • R Textbooks
    • The R Project website; click on documentation.
    • Data Manipulation with R by Phil Spector; SpringerLink ebook
    • Software for Data Analysis: Programming with R by John Chambers; SpringerLink ebook
    • R by Example by Jim Albert and Maria Rizzo; SpringerLink ebook
    • Introductory Statistics with R by Peter Dalgaard; SpringerLink ebook
  • SQL and Command Line Textbooks and links
    • MySQL website; click on manuals for an introduction to R.
    • SQL Cookbook by Anthony Molinaro; Safari ebook, hardcopy via interlibrary loan
    • Learning SQL, 2nd Edition, Alan Beaulieu; Safari ebook, hardcopy via interlibrary loan
    • MySQL in a Nutshell by Russell Dyer; hardcopy
    • The Linux Command Line by William E. Shotts, Jr.
  • Websites for Data Sets
  • Websites and Books for Visualizing Data
  • Websites for learning the latest software tools
    • Lynda.com All EIU students, faculty, and staff have free unlimited access.
    • DataCamp Data Science Education via R and Python.
    • edX uses DataCamp as part of their exercises; also contains a news and career blog.
    • OpenIntro Free open source introductory textbook in R and Statistics.
  • Podcasts, Blogs, and More Websites
Assignments and Tentative Schedule

Projects will be weekly and scores will be posted in D2L . Before attempting problems, you are expected to read the material carefully. You are strongly encouraged to discuss assignments with your instructor and to work with your fellow classmates. However, you are expected to write up your own solutions. The semester will conclude with a final project presentation to take place during scheduled final exam day and time.

It is assumed that each student has their own laptop. There are also two computer labs located in Old Main 3041 and 3013. The computers in the library may not have all relevant software we need but if you click on this link then 'VMware Horizon HTML Access', you will be able to access the virtual desktop.


Daily activities and assignments
Monday Wednesday Friday
9 Jan
How to connect to the virtual desktop and a quick introduction to RStudio. Click here for slides from a recent talk.
11 Jan
Click here for Introduction to RStudio .
13 Jan
Class canceled; instead click here for Project 1 due Fri 20 Jan. (last updated Jan 15)
16 Jan
No Classes
18 Jan
Reading material: An Introduction to R.
Click here for intro to plots, tapply, and data frames.
20 Jan
Click here for Project 2 due Fri 27 Jan. (last updated Jan 22)
23 Jan
Reading material: R Graphics
Click here for intro to data visualization and graphics.
25 Jan
27 Jan
Click here for Project 3 (last updated Jan 29)
30 Jan
Click here for continued study of data visualization; intro to data structures.
1 Feb
Windows users, please visit Cygwin and PuTTY/VNC .
3 Feb
Click here for Project 4 (last updated Feb 3)
6 Feb
Click here an introduction to R Markdown and ggplot.
8 Feb
10 Feb
Click here for Project 5 (last updated Feb 19)
13 Feb
Click here for an introduction to using the terminal window.
15 Feb
Click here for an introduction to SQLite.
17 Feb
No Classes
20 Feb
Click here for SQLite in R (in mathvnc).
Click here for MySQL in R (in virtual machine).
22 Feb
Click here for regular expressions in SQL.
24 Feb
guest speaker Dr. Anabela Maia from Biology
27 Feb
Click here for an introduction to statistics in R.
1 Mar
3 Mar
Click here for Project 6 due today (last updated Mar 5)
6 Mar
Click here for an introduction to ANOVA.
8 Mar
EIU Career Services; current EIU graduate student; UIUC resume writing and samples ; industry and government careers; latex resume templates in Overleaf.
10 Mar
Final Project Brainstorm!
Project 7 due today (last updated Mar 3)
20 Mar
regular expressions 101
22 Mar
Job Search: indeed.com, linkedin.com, usajobs.gov.
24 Mar
Project 8 due: Cover letter for a job, Resume, and read Ch 3 of WMD
guest speaker Dr. Barry Kronenfeld.
27 Mar
Analyzing Chicago Crime Statistics
29 Mar
Monte Carlo Simulations
31 Mar
Project 9 due today (last updated Mar 28)
3 Apr
Ch 4 WMD
Click here for an introduction to ggmap.
5 Apr
maps
7 Apr
Project 10 (last updated: 5 Apr) due Wed 19 Apr.
10 Apr
Ch 5 WMD
Click here for an introduction to htmltab.
12 Apr
14 Apr
guest speaker Mr. Ahmed Salim Nuhu
17 Apr
Ch 6
19 Apr
Ch 7
guest speaker Dr. Andrew Mertz
21 Apr
Ch 8
5 minute project presentation
24 Apr
Ch 9
JSON to CSV Converter
Data.json Merge Tool
26 Apr
Ch 10
reddit datasets
reddit data is beautiful
reddit beautiful maps
28 Apr
Conclusion
5 minute project presentation

Course Grade

The course grade will be composed of the grades on the projects and be determined using the following distributions. A: 90-100%, B: 80-89%, C: 70-79%, D: 60-69%, and F:0-59%.

Attendance

Students are expected to attend class meetings as scheduled. When an absence does occur, the student is responsible for the material covered during the absence. When possible, the student should notify the instructor in advance of an anticipated absence.

Course and Instructor Evaluations

During the last few weeks of the semester, you will be provided an opportunity to evaluate this course and your instructor. Your feedback is vital to improving education at Eastern Illinois University. I strongly urge you to participate in the evaluation system.

Accommodations for Students with Disabilities

Click here to view information regarding accommodations.

Academic Dishonesty

Eastern Illinois University faculty and students commit themselves towards maintaining a culture of academic integrity and honesty. Click here to view the Student Code of Conduct. Violations will be reported to the Office of Student Standards, which can result in a failing grade for the course.

Student Success Center

Students who are having difficulty achieving their academic goals are encouraged to contact the Student Success Center. The center provides individualized consultations for assistance with time management, test taking, note taking, avoiding procrastination, setting goals, and other skills to support academic achievement.

Take Care of Yourself

Diminished mental health, including significant stress, mood changes, excessive worry, or problems with eating and/or sleeping can interfere with optimal academic performance. The source of symptoms might be related to your course work; if so, please speak with me. However, problems with relationships, family worries, loss, or a personal struggle or crisis can also contribute to decreased academic performance. Know that help, if needed, is available.

EIU provides cost-free mental health services through the Counseling Center to help you manage personal challenges that threaten your personal or academic well-being. Visit them on the first floor of the Human Services Building, Monday through Friday from 8:00 am-4:30 pm. You may also contact them at 217-581-3413. For after-hours emergencies, contact their emergency number at 217-549-6483.

If you are uncomfortable seeking on-campus services, LifeLinks in Mattoon offers mental health services. Although they charge fees for some services, they have a no-cost toll free Crisis Line (866-567-2400); when a caller phones the toll free number, they are immediately connected to a trained crisis clinician. There is no cost for this service, and it is available 365 days per year, 24 hours per day.

Remember, getting help is a smart and healthy thing to do - for yourself and for your loved ones.