CDC

Data is impacting many areas of science, engineering, and industry; from analyzing troves of weather data to modeling traffic patterns to processing millions of online customers, it is the enormous data which is creating new opportunities and challenges. To tackle these challenges, one must have the training to store, manage, process and analyze data at these scales. But the challenges are beyond scale alone, the complexity of the data requires new powerful analytical techniques. Finally, it is crucial to have skills in communicating and interpreting the results of this analysis. A person trained in all of these skills is a data scientist.

Graduate Certificate in Data Science Program

Applying

Admissions to the Graduate Certificate in Data Science here.

Frequently Asked Questions

Applying and Enrolling in the Program

A: You should follow these application instructions. It involves applying as a non-matriculated student, and a short form with some background information, a CV, and a personal statement, and a question about your background in programming/computing.

A: Then just go to the second step here: application instructions. It is a short form with some background information, a CV, and a personal statement, and a question about your background in programming/computing.

A: We process the applications on a rolling basis. You may hear back in a few days, but it may be a few weeks.

A: A student who does well in the certificate classes will be a very strong candidate for the MS program. But they will need to apply here. The most similar degree is the MS in Computing under the Data Management and Analysis track.

A: Up to 9 hours (3 classes) can transferred from non-matriculated to a graduate degree program.

A: No. It is not a degree program. For this, one would need to be enrolled in the MS or PhD program.

Programming and Computing

A: The Graduate Certificate in Data Science consists of classes which are also taken and required for MS and PhD students in the School of Computing, so they all have programming and computing components. However, they are typically not among the heaviest programming classes (except the optional CS6530 Advanced Database Systems). That said, we will expect for admitted students that short abstract programming tasks will not be a burden to your learning experience, but rather a way to enhance it.

Many successful applicants/students aiming for the certificate are software engineers, while others work in a quantitative area where they do some minor programming. Having taken some computer science classes at a university a while back is fine (it should come back to you). But we expect you can do more than just scripting; use of data structures is essential to manipulating big data.

A: While other certificate or degree programs may focus on software tools (e.g. Tableau) that try to remove the need for programming from the job of a data analyst, we feel this approach is limited. All of these tools work in limited scenarios, with limited data sizes and formats. Our program focuses on the fundamentals of data science, including how these sorts of tools work. But more generally, as the tools change we want to train our students to understand what is going on behind this change, how to analyze and handle a problem when the data is too big for a tool, and how to investigate and evaluate new developments before they are integrated into easy to use mainstream programs. For instance, a tool or package may present many options to learn a classifier (e.g. given nicely formatted and labeled customer data, predict which of the future potential customers will contribute to a net profit), our program aims to teach students which option to use, how much to trust that prediction, how to apply this to very large datasets, and potentially how to go beyond the presented options.

If an exciting new technique is described in a brand new research paper about data science, we hope our students will be trained to find, understand, and put this technique to work.

A: Most of the classes are fairly agnostic to the programming language, and can be completed in many languages ranging from R to Python to C++. High level languages like python, or to some extent even including R, are becoming built out to deal with larger scale. More importantly, they now have many useful libraries that really eases the implementation of many core tasks in data management and analysis. On the other hand, knowing and using a more low-level language like C, C++, or Java, will perhaps allow students to get more out of the classes and assignments. Students have witnessed 3 to 4 orders of magnitude speed up on some assignments using Java instead of R.

Certain classes may require C, SQL, or Processing (based on Java), but (with the exception of CS6530 Database Systems) do not expect much if any prior experience in those languages by the students.

Costs

A: It is a bit complicated. It depends on how spread out you take the classes and whether you are a Utah resident or not. For residents at a 1-class/semester pace, the certificate should cost roughly $10,000.

Here are the rate schedules for residents (lived in Utah last 3+ years) and non residents.
So if one takes the classes more rapidly, then it will cost less.
We are investigating normalizing this cost.

Courses and Availability

A: The plan is to offer the following classes every Fall and Spring.
Fall:

  • cs6630 Visualization for Data Science
  • cs6530 Database Systems
  • cs6350 Machine Learning

Spring:

  • cs6140 Data Mining
  • cs5530 Database Systems (can be substituted for cs6530)

A: The times and days will be on a semester by semester bases. The most up to date tentative plans for scheduling can be found at the University of Utah Class Catalog. Once you have chosen the appropriate semester, all classes are listed under the CS : Computer Science link.

We plan to mostly video tape and live stream, depending on demand.

A: The document found here serves as the official requirements. It may be updated on an annual basis. You may complete the certificate under any such official document posted during the course of your enrollment in the program.

A: 1 or maybe 2. It will depend on your background. Some classes have a bit less load than others, so pairing two classes with a lesser load, especially if you have some background in the area may be doable. But most students also working full time take 1 class at a time.

That said, some very dedicated students do manage to successfully take 3 or more classes at a time while also working. 3 classes is a full load for full-time MS or PhD students.

A: No, not in general. However, given an individual’s background and time off from school, we may guide you towards classes that may be a bit easier first so that it is easier to get back into the swing of things.