An introduction to programming in Python language, using Jupyter Notebooks and Python scripts. Covers variables, conditionals, loops, functions, lists, strings, tuples, sets, dictionaries, files and visualization.
Restriction: Must be in one of the following programs: (Data Science Post-Baccalaureate Certificate, Master of Professional Studies in Data Science and Analytics, or Master of Professional Studies in Machine Learning). Cross-listed with: BIOI602, MSML602.Credit Only Granted for BIOI602, DATA602, MSML602, or CMSC641.
Formerly: CMSC641.
The Independent Research Project requires that students write a preliminary proposal explaining what they expect to accomplish during their research project. A student may engage in research by either serving as an assistant to an ongoing project that has been initiated by a faculty member, or he or she may initiate his or her own research project with a faculty sponsor.
Restriction: Students are expected to have completed CMSC330 / CMSC351 prior to taking CMSC499A.
An introduction to the data science pipeline, i.e., the end-to-end process of going from unstructured, messy data to knowledge and actionable insights. Provides a broad overview of what data science means and systems and tools commonly used for data science, and illustrates the principles of data science through several case studies.
Prerequisite: STAT100, MATH135, or any 400-level STAT course.
The course is dedicated to the study of ethical issues associated with data science, including data collection, gathering existing data, ethical use of data, data analysis with teams, repeatability and reproducibility of data analysis, and academic and scientific integrity.
An introduction to the data science pipeline, i.e., the end-to-end process of going from unstructured, messy data to knowledge and actionable insights. Provides a broad overview of what data science means and systems and tools commonly used for data science, and illustrates the principles of data science through several case studies.