Abstract
This paper presents an optimized implementation of the Apriori algorithm tailored for large-scale data mining in cloud-native, serverless environments, utilizing real-world fuel datasets. Our approach achieves a 28% reduction in execution time and a 22% decrease in memory consumption compared to traditional distributed Apriori methods. The study leverages high-dimensional fuel datasets, spanning from 2020 to 2050, to evaluate scalability and efficiency in processing energy-related data. By employing advanced synchronization and deferred partitioning strategies, communication overhead is significantly reduced, improving performance while effectively balancing computational loads across distributed nodes. Security measures, including AES-256 encryption and role-based access control (RBAC), are incorporated to safeguard data confidentiality and ensure compliance with regulatory frameworks. The proposed solution scales efficiently for datasets up to 1 million records, demonstrating applicability across domains such as transportation and logistics. Future work will explore adaptive partitioning techniques, hybrid cloud architectures, and AI-driven predictive analytics to further enhance scalability and operational efficiency in serverless multi-cloud systems.
Original language | English |
---|---|
Pages (from-to) | 452-459 |
Number of pages | 8 |
Journal | Procedia Computer Science |
Volume | 257 |
DOIs | |
Publication status | Published - 2025 |
Externally published | Yes |
Event | 16th International Conference on Ambient Systems, Networks and Technologies Networks, ANT 2025 / 8th International Conference on Emerging Data and Industry 4.0, EDI40 2025 - Patras, Greece Duration: 22 Apr 2025 → 24 Apr 2025 |
Keywords
- Apriori Algorithm
- Big Data
- Cloud-Native Architectures
- Data Mining
- Distributed Computing
- Maritime
- Serverless Computing
ASJC Scopus subject areas
- General Computer Science