Now Publishing Complete Lahman Baseball Database with Datasette

Summary: The Datasette API available at now contains the full Lahman Baseball Database.

In a previous post, I described how I'm using Datasette to publish a subset of the Lahman Baseball Database. At that time, I only published three of the 27 tables available in the database. I've since expanded that Datasette API to include the complete Baseball Database.

The process for this was quite straightforward. I ran the MySQL dump Lahman helpfully provides through this mysql2sqlite tool to provide an import file for sqlite. Importing into sqlite for publishing with Datasette was as simple as:

$ ./mysql2sqlite lahman2016.sql | sqlite3 baseball.db

The complete sqlite version of the Lahman database is 31 megabytes.


With the full database now loaded, there are many more interesting queries that can be run. Including:

  • Who played the most games at 2B in MLB history? Answer
  • In what park has the most MLB games been played? Answer
  • Which players who made their debut in the 1950 season or later committed they most errors? Answer

Happy querying. If you are using this in a project or interested in learning more, leave a comment below or contact me directly.

Comments !