Working with Large Data Sets through Emory Libraries

Whether you’re working on a class project, conducting a market analysis, or exploring new research ideas, Emory Libraries has a variety of resources available for accessing datasets.

Dewey Data

Dewey Data logoThis research platform provides access to third-party quantitative and geospatial data focused on finance, marketing, consumer demographics, behavioral economics, urban planning and real estate. Their datasets let you explore spend data, time series data, mobile phone ping data, and more.

Dewey Data provides many examples of academic research made possible through their data partners:

Register for an account with your Emory email. Then, use this Dewey Data Guide with a variety of tutorials and documentation to learn how to use Dewey Data and work with its datasets.

TDM Studio

If you’re focused on text and data mining, use TDM Studio to create and analyze datasets from a selection of ProQuest newspapers, including Barron’s, Boston Globe, Chicago Tribune, Los Angeles Times, New York Times, Washington Post, Wall Street Journal, and USA Today. You can evaluate the data using R or Python in TDM Studio’s Jupyter Notebook, and also use their internal visualization options for geographic analysis, topic modeling, and sentiment analysis.

Data Resources from Emory Center for Digital Scholarship (ECDS)

This guide was created by ECDS. This curated collection of data includes data for topics including Census, Economics, Health, Public Opinion, and Social Indicators. This resource also includes information about applying statistical and mapping software, such as Python, R, SAS, SPSS, Stata, and ArcGIS for analysis of this data.

GBL Datasets and Text Analysis Guide 

This guide points to these and many other data rich sources to explore, as well as recommended training and tutorials.

If you have questions about registering or getting started with these resources, just Ask a Librarian.