
The exponential growth of the global datasphere, which is anticipated to reach a staggering 175 zettabytes by 2025, has instigated a revolution in big data technologies. This unprecedented expansion necessitates significant advancements in data collection and processing infrastructure to handle such vast amounts efficiently. Innovations like IoT networks, distributed computing frameworks, and real-time machine learning pipelines have emerged as essential