About dadosbrasil

Source, cadence, methodology and limitations.

dadosbrasil publishes the open data of Brazil's National Registry of Legal Entities (CNPJ) in three formats: HTML (this site), Markdown, and a JSON API. We don't collect anything ourselves: every row comes directly from the public files on the Brazilian federal open-data portal.

Sources

Refresh cadence

The Federal Revenue publishes a new snapshot of the CNPJ archive roughly once a month. dadosbrasil is fully re-imported on every release — we don't do incremental diffs, since the official consistency guarantee is per snapshot. So the effective "as-of" date of our data is the date of the most recent RFB snapshot, not today; companies changed very recently may take up to ~6 weeks to show up.

Methodology

Pages are generated straight from the RFB files with no manual enrichment. The only additional computations are:

Privacy and CPF masking

The Federal Revenue itself publishes individual partners' CPF numbers in the masked form ***XXXXXX** — the first three and last two digits are always hidden. dadosbrasil just reproduces this form. We never try to reconstruct the full CPF, nor to cross-reference masked CPFs with other databases to de-anonymize anyone.

If you show up as a partner and want your page (/en/cpf/***XXXXXX**) removed, the first step is to speak with the Federal Revenue — the source. Since our data mirrors the public snapshot, any RFB correction propagates automatically on the next monthly cycle. For urgent cases, reach out via GitHub Issues.

Known limitations

License and attribution

The original data is public and free to use under the Federal Revenue's terms. Derivative works should cite Receita Federal as the source and, optionally, dadosbrasil as the intermediary/reformatter.

Source code

dadosbrasil is open source at github.com/brazildata/api.

Em português