Databases and SQL
In the late 1920s and early 1930s, William Dyer, Frank Pabodie, and Valentina Roerich led expeditions to the Pole of Inaccessibility in the South Pacific and then onward to Antarctica. Two years ago, records from their expeditions were found in a storage locker at Miskatonic University.
We have scanned and OCR’d their data and now want to store that information in a way that will make search and analysis easy.
Three common options for storage are text files, spreadsheets, and databases. Text files are easiest to create, and work well with version control, but we would need to build our own tools to manage and analyze the data. Spreadsheets are good for doing simple analyses but they don’t handle large or complex data sets well. Databases, however, include powerful tools for search and analysis and can handle large, complex data sets with ease. These lessons will show how to use a database to explore the expeditions’ data.
Prerequisites
If SQLite is being used from the shell, learners will need to be able to navigate directories and run simple commands from the command line. If a GUI such as the Firefox SQLite plugin is being used, learners will need to know how to install browser plugins (and have permission to do so).
Check Discussion for database setup instructions
Topics
- Selecting Data
- Sorting and Removing Duplicates
- Filtering
- Calculating New Values
- Missing Data
- Aggregation
- Combining Data
- Creating and Modifying Data
- Programming with Databases