Generate Your Own Fake Data In Seconds
Usually, for executing/testing a pipeline, we need to provide it with some dummy data.
Although using Python's "๐ซ๐๐ง๐๐จ๐ฆ" library, one can generate random strings, floats, and integers. Yet, being random, it does not output any meaningful data such as people's names, city names, emails, etc.
Here, looking for open-source datasets can get time-consuming. Moreover, it's possible that the dataset you find does not fit pretty well into your requirements.
The ๐ ๐๐ค๐๐ซ module in Python is a perfect solution to this. Faker allows you to generate highly customized fake (yet meaningful) data quickly. What's more, you can also generate data specific to a demographic.
Read more here: Documentation.
Read this post on LinkedIn: Post Link.
I like to explore, experiment and write about data science concepts and tools. You can read my articles on Medium. Also, you can connect with me on LinkedIn.


