Pandas' describe method is pretty naive.
It hardly highlights any key information about the data.
Instead, try Skimpy.
It is a Jupyter-based tool that provides a standardized and comprehensive data summary.
By invoking a single function, you can generate the above report in seconds.
This includes:
data shape
column data types
column summary statistics
distribution chart,
missing stats, etc.
What's more, the summary is grouped by datatypes for faster analysis.
Get started with Skimpy here: Skimpy.
👉 If you liked this post, don’t forget to leave a like ❤️. It helps more people discover this newsletter on Substack and tells me that you appreciate reading these daily insights. The button is located towards the bottom of this email.
Thanks for reading!
Whenever you’re ready, here are a couple of more ways I can help you:
Get the full experience of the Daily Dose of Data Science. Every week, receive two curiosity-driven deep dives that:
Make you fundamentally strong at data science and statistics.
Help you approach data science problems with intuition.
Teach you concepts that are highly overlooked or misinterpreted.
Promote your product to over 28,000 subscribers by sponsoring this newsletter.
👉 Tell the world what makes this newsletter special for you by leaving a review here :)
👉 If you love reading this newsletter, feel free to share it with friends!