California secretary of state refuses to open arcane data systems

We asked Alex Padilla’s office to show us how to use their database tracking money in politics. They said no.

Alex Padilla
Alex Padilla

The office of California Secretary of State Alex Padilla has refused our request for the computer code that accurately extracts data from the state’s campaign finance and lobbying activity database.

Last year, we began an effort to create open-source software that makes it easier to access and understand a raw copy of the database, known as CAL-ACCESS, published online by Padilla’s predecessor.

During careful study of the sprawling dump, which includes more than 36 million records spread across 76 interlocking database tables, we discovered that sophisticated filters, links and transformations are necessary to follow the millions of dollars that oil the statehouse in Sacramento.

We also found the database documentation released by the state insufficient, offering fragmentary and incomplete explanations of how to work with the data.

Many database tables have been defined only by cryptic references to characters named “Matt,” “D H,” “J M” and “Cox.”

An example of what to expect
An example of what to expect

Further, some important elements of the database have been excluded entirely, like the crucial connections that link campaign fundraising committees to elected officeholders and high-stakes ballot initiatives.

The result: A high barrier blocks anyone seeking to interpret the data, requiring weeks of study and significant guesswork to overcome, while raising the menacing risk that uninitiated analysts will make a grievous error.

In an effort to solve the problem, last month we utilized the California Public Records Act to request that state officials release the computer code that supplies data to the state’s searchable user-interface at cal-access.ss.ca.gov.

While the tools provided by the state site are limited, the codebase that powers it should include a negotiation of the underlying CAL-ACCESS database that returns accurate data.

On March 2, that request was denied by an anonymous email account, [email protected].

“This source code is computer software that is not a public record pursuant to Government Code Section 6254.9 and, therefore, is not subject to disclosure requirements of the California Public Records Act,” the response said.

That same day we responded with disappointment and requested that Padilla’s office reconsider its decision. We have yet to receive a reply.