Pengembangan Sistem Pendukung Keputusan Manajemen Risiko Logistik E-Commerce Berbasis Machine Learning Menggunakan Random Forest Pipeline
DOI:
https://doi.org/10.71282/jurmie.v3i6.2259Keywords:
Logistics, Machine Learning, Data Mining, Streamlit, LogiTrackAbstract
The e-commerce logistics industry faces challenges due to the high risk of fleet delivery delays, where the analysis of risk mitigation and its financial impact is often limited for retail operational management. This project aims to develop an Integrated Logistics Risk Management Decision Support System Smart Platform named LogiTrack v2.0, which combines two main data engineering domains: Data Mining and Machine Learning. For data analysis, an artificial intelligence model was developed using the Random Forest Classifier algorithm within a Scikit-Learn Pipeline architecture, which detects minority class data via the class_weight='balanced' parameter to automatically predict cargo delay status based on the engineering of four composite features (Volume, Density, Shipping Fee per Gram, and Route). For smart interaction, binary file .pkl optimization was implemented using the Joblib library with compression level 3 to radically reduce storage capacity from 138.80 MB to 28.40 MB. All of these functionalities are integrated into a single interactive web application built using Streamlit and deployed on the Streamlit Cloud server. The dashboard is equipped with two main tools: a Guardrail Engine function to block package inputs exceeding retail capacity ($>30$ Kg) and a Business Impact Calculator panel to calculate the conversion of management financial penalties by 20% in real-time. Functional testing results via Black-Box Testing show that the classification model achieves a global accuracy rate of 90.00% and a precision value of 0.31, while the web platform successfully operates stably with low latency. This platform offers a holistic solution that enhances cargo distribution transparency, optimizes operational risk control efficiency, and suppresses potential financial losses within the logistics industry ecosystem.
Downloads
References
[1] M. K. Lim, Y. Li, C. Wang, and M. L. Tseng, "Evaluating the logistics performance of e-commerce companies in Indonesia: A customer-centric approach," Journal of Cleaner Production, vol. 313, p. 127926, Sep. 2021.
[2] G. L. R. de Oliveira, "Spatial and temporal analysis of e-commerce delivery performance using public datasets: A case study of Olist Brazil," International Journal of Logistics Research and Applications, vol. 25, no. 4, pp. 512-530, Apr. 2022.
[3] J. Wang, Y. Zhang, and X. Liu, "A predictive machine learning framework for supply chain delivery delay using gradient boosting and random forest," Transportation Research Part E: Logistics and Transportation Review, vol. 154, p. 102465, Oct. 2021.
[4] R. Fernandez, A. C. Lorena, and J. A. Olvera-Lopez, "Addressing extreme class imbalance in predictive maintenance and logistics log data using balanced ensemble classifiers," Expert Systems with Applications, vol. 182, p. 115204, Nov. 2021.
[5] S. Chopra and M. Singh, "Data-driven decision support systems in e-commerce logistics: From predictive analytics to operational strategies," Computers & Industrial Engineering, vol. 161, p. 107640, Dec. 2022.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Fetty Tri Anggraeny, Naufal Firman Dhani (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.










