Intelligent Data Partitioning for Distributed Cloud Analytics

Authors

  • Feroskhan Hasenkhan Truveta, USA Author
  • Arun Ayilliath Keezhadath Amazon Web Services, USA Author
  • Lalitha Amarapalli Fresenius-Kabi, USA Author

Keywords:

intelligent data partitioning, distributed cloud analytics, horizontal partitioning, scalability

Abstract

The purpose of this paper is to explore the importance of intelligent data partitioning techniques which enhances the efficiency of distributed cloud analytics and especially focusing on big data. Cloud computing continuously handles huge volume of data for that reason efficient data partitioning becomes very critical in minimising query response time and optimising computational resources.

Downloads

Download data is not yet available.

References

S. Ghosh, N. K. Gupta, and S. P. Raja, "Data partitioning strategies for distributed

cloud systems," Journal of Cloud Computing: Advances, Systems, and Applications,

vol. 9, no. 3, pp. 59-71, 2022.

Singu, Santosh Kumar. "Real-Time Data Integration: Tools, Techniques and Best

Practices." ESP Journal of Engineering & Technology Advancements 1.1 (2021):

-172.

Singu, Santosh Kumar. "ETL Process Automation: Tools and Techniques." ESP

Journal of Engineering & Technology Advancements 2.1 (2022): 74-85.

L. Zhang, W. Li, and Y. Tang, "A comprehensive study on data partitioning techniques

for big data analytics in cloud environments," International Journal of Cloud

Computing and Services Science, vol. 6, no. 1, pp. 11-21, 2021.

T. Wu, H. Zhang, and Y. Hu, "Cost-based query optimization for distributed cloud

systems," IEEE Transactions on Cloud Computing, vol. 8, no. 4, pp. 1025-1038,

R. J. Wang, H. Y. Xu, and W. Chen, "Intelligent cloud data partitioning using machine

learning," IEEE Transactions on Big Data, vol. 7, no. 3, pp. 228-239, 2020.

A. K. Agarwal and S. Joshi, "Dynamic partition pruning for cloud analytics," ACM

Transactions on Database Systems, vol. 44, no. 1, pp. 34-47, 2021.

J. G. Lee, W. Kim, and P. S. Lin, "Cost-based optimization for large-scale data

partitioning in cloud environments," Journal of Cloud Computing, vol. 10, no. 2, pp.

-95, 2019.

S. Patel, J. Roy, and B. Guha, "Improving distributed query performance using

intelligent data partitioning techniques," IEEE Access, vol. 8, pp. 30697-30705, 2020.

M. T. Mohamed and P. J. Darrell, "Scalability challenges in cloud-based data

partitioning," International Journal of Distributed and Parallel Systems, vol. 12, no. 5,

pp. 56-67, 2021.

A. Singh, V. K. Jain, and R. Sharma, "Hybrid partitioning techniques for cloud

analytics," IEEE Transactions on Services Computing, vol. 15, no. 3, pp. 234-246,

H. Y. Lee and M. Y. Chen, "Efficient partitioning strategies for real-time cloud-based

analytics," IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 8, pp.

-1686, 2020.

B. Lee, W. Zhang, and H. Kim, "Adaptive data partitioning with machine learning for

cloud environments," Journal of Cloud Computing and Big Data Analytics, vol. 7, no.

, pp. 99-110, 2021.

P. Gupta, S. Agarwal, and R. Soni, "Cost-based strategies for optimizing data

partitioning in distributed cloud environments," Cloud Computing: Theory and

Practice, vol. 5, no. 3, pp. 45-55, 2020.

Z. Lu and T. Wu, "Evaluating partition pruning techniques in distributed systems,"

IEEE Transactions on Network and Service Management, vol. 18, no. 2, pp. 234-245,

A. Bose, S. Kumar, and M. Mehta, "Efficient query execution in distributed cloud

systems using intelligent partitioning strategies," ACM Computing Surveys, vol. 53,

no. 5, pp. 1-25, 2020.

J. V. P. Jadhav and T. K. Peddabachagari, "Data distribution and partitioning for cloud

data warehouses," IEEE Transactions on Cloud Computing, vol. 6, no. 4, pp. 992-

, 2019.

P. K. Sharma and A. K. Jain, "Optimizing data partitioning strategies for distributed

cloud systems," International Journal of Advanced Computer Science and

Applications, vol. 11, no. 2, pp. 123-135, 2020.

L. Zhang, H. S. Rao, and J. Liu, "Efficient query optimization techniques for large-

scale distributed systems," IEEE Access, vol. 9, pp. 6712-6723, 2021.

W. M. Ko, A. P. Zhang, and L. R. Wong, "Data partitioning for distributed database

systems in cloud environments," Journal of Computer Science and Technology, vol.

, no. 1, pp. 58-72, 2020.

C. S. Goh and K. H. Lee, "Evaluating hybrid cloud data partitioning techniques for

query performance," Journal of Cloud Computing and Distributed Systems, vol. 13,

no. 4, pp. 45-59, 2020.

S. H. Kim, E. P. Kim, and T. Y. Hong, "Scalable data partitioning for cloud-based big

data analytics," Journal of Supercomputing, vol. 77, no. 3, pp. 853-871, 2021.

Downloads

Published

23-08-2023

How to Cite

[1]
Feroskhan Hasenkhan, Arun Ayilliath Keezhadath, and Lalitha Amarapalli, “Intelligent Data Partitioning for Distributed Cloud Analytics”, Newark J. Hum. Centric AI Robot Inter., vol. 3, pp. 106–145, Aug. 2023, Accessed: Feb. 16, 2026. [Online]. Available: https://njhcair.org/index.php/publication/article/view/17