Anonymized Invoices
If you are building invoice recognition models a, a dataset with real invoices is invaluable. This dataset is a collection of anonymized invoices, 60% of which are emitted by the same company, and the remaining 40% emitted by a variety of businesses.
If you are building invoice recognition models a, a dataset with real invoices is invaluable. This dataset is a collection of anonymized invoices, 60% of which are emitted by the same company, and the remaining 40% emitted by a variety of businesses.
If you are building invoice recognition models a, a dataset with real invoices is invaluable. This dataset is a collection of anonymized invoices, 60% of which are emitted by the same company, and the remaining 40% emitted by a variety of businesses.
If you are building invoice recognition models a, a dataset with real invoices is invaluable. This dataset is a collection of anonymized invoices, 60% of which are emitted by the same company, and the remaining 40% emitted by a variety of businesses.
Dataset specs
Type
Text
File format
Region/Locale
EN
Amount
50K
Leverage
This dataset supports the development of AI systems that extract, understand and analyse financial information from real world invoices at scale.
Use cases
Improve document understanding pipelines by teaching models to read structured PDF invoices through optical character recognition.
Enable systems that classify expenses, match line items to accounting codes and prepare data for the best Enterprise Resource Planning or bookkeeping tools.



Do you need a specific dataset?
We understand the uniqueness of every project. That's why we offer customizable dataset solutions to match your specific requirements.

Dataset specs
Type
Text
File format
Region/Locale
EN
Amount
50K