Skip to main content

About the Educational Dataset Service (EDS)

What is the EDS?

The UC San Diego Educational Dataset Service (EDS) houses curated, machine-learning-ready datasets for use in instruction and research. These datasets may include any and all of real research datasets, sanitized and synthetic administrative data from campus sources, and datasets commonly used for training in specific disciplines. The service will eventually also offer guidance & training for students on working with associated datasets, such as how to re-use data, how to cite sources, and how to work with metadata. Finally, it will include support for instructors in developing & assessing meaningful assignments.

Who is the EDS for?

The EDS is available for general use by anyone. Initial implementation of the EDS, however, is guided by the need we have heard expressed by instructors, researchers and students for reputable, verified and accessible datasets that are appropriate for coursework, especially for training machine learning and artificial intelligence models. The EDS will not only host appropriate datasets, but will also eventually include training and usage resources around the data.

Opportunities for Student Engagement

The EDS will not only be FOR students, it will be developed WITH students. There will be opportunities (including paid positions) for students to engage in the creation of the system, including coding and tech support, working with data and metadata creation, and liaising with campus faculty and researchers.

Coming Soon

Questions, comments or feedback? Contact us at: datanexus@ucsd.edu.