Now Publishing Complete Lahman Baseball Database with Datasette
In a previous post, I described how I'm using Datasette to publish a subset of the Lahman Baseball Database. At that time, I only published three of the 27 tables available in the database. I've since expanded that Datasette API to include the complete Baseball Database.
The process for this was quite straightforward. I ran the MySQL dump Lahman helpfully provides through this mysql2sqlite tool to provide an import file for sqlite. Importing into sqlite for publishing with Datasette was as simple as:
$ ./mysql2sqlite lahman2016.sql | sqlite3 baseball.db
The complete sqlite version of the Lahman database is 31 megabytes.
With the full database now loaded, there are many more interesting queries that can be run. Including:
- Who played the most games at 2B in MLB history? Answer
- In what park has the most MLB games been played? Answer
- Which players who made their debut in the 1950 season or later committed they most errors? Answer
Happy querying. If you are using this in a project or interested in learning more, leave a comment below or contact me directly.