Protect your data and ML products from low-quality data
The open-source framework for precision data testing for data scientists and ML engineers.
Build confidence in the quality of your data by defining schemas for complex data objects
Pandera provides a simple, flexible and extensible data-testing framework for validating not only your data, but also the functions that produce them.
A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness
Write Complex Schemas with Ease
Leverage Pandera’s zero-configuration API for defining schemas using modern Pythonic idioms.Learn More
Validate Critical Points of Your Pipeline
Identify the critical points in your data pipeline, and validate data going in and out of them.Learn More
Quickly Bootstrap Schemas with Trusted Data
Overcome the initial hurdle of defining a schema by inferring one from clean data, then refine it over time.Learn More
Easily Create Custom Validation Checks
Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases.Learn More
Synthesize Fake Data to Validate Pipelines
Validate the functions that produce your data by automatically generating test cases for them.Learn More
Integrate seamlessly with the Python ecosystemSuggest Integration
Join Our Community
Join the community to help simplify data testing!
Become a Contributor
Open issues, feature requests and PRs.Get Started
Get Support on Discord
Join us on our team chat, ask questions, and help others!Join Discord
Help Improve Our Documentation
See a typo or room for improvement? Help us!Check Out Our Docs