Publish the Constitution Annotated As Data
Dear Library of Congress and Government Printing Office,
For decades, you have jointly published a handy compendium that explains the U.S. Constitution as it has been interpreted by the U.S. Supreme Court. It took a couple of letters from the Senate (and repeated nudging from the public interest community—2009, 2010, 2011, 2012, 2013) to move you to publish the Constitution Annotated online more than once a decade, but you still do not regularly publish it online in a structured-data format. Instead, the Constitution Annotated is published as a PDF, which has not been updated in 15 months.
The entire point of the document is to educate the public and Congress about the Constitution. As a technical matter, the Constitution Annotated is prepared as an XML file, published internally to congressional staff as a series of web pages, and updated regularly. You could simply make those pages available to the public and we would all be happy. Instead, the public interest community must keep pestering you, year after year.
Why do we care? Publishing the Constitution Annotated in a structured-data format means that the public can easily reuse the information so that more people can benefit from the knowledge it contains. Structured data makes it easier to embed the information in Wikipedia, or create better websites on the Constitution, and so on. It also means we can do neat things with the contents, such as automatically classifying Supreme Court cases by topic simply by drawing upon the document’s structure.
Publishing the Constitution Annotated in structured-data format is also within your mission. As the respective repository and publisher of government-generated information, providing public access to an authoritative explanation of our nation's founding document, as interpreted over the years, is the kind of thing you do.
So I ask you, on Constitution Day 2014, let's get this fixed before next year. We're happy to help.