East Delta University

Financial and Operational Decision Support System for Chittagong Port:

A Data-Driven Approach

(Capstone Project E-Paper)

University Logo

by Md Towhidul Islam

Student ID: 241001761

Supervisor: Prof. Mohammad Wahidul Islam
(Director – EDU Foundry)

A Capstone Project Presented to the Faculty of the

SCHOOL OF BUSINESS ADMINISTRATION

In Fulfillment of the Requirements for the Degree

Master Of Science In Data Analytics And Design Thinking For Business

Mojaffornogor, Chattogram, Bangladesh

December 2024

1. Abstract

Chittagong Port is the busiest seaport in Bangladesh, handling over 92% of the nation's trade. This port is a pivotal trade hub for Bangladesh, facing mounting pressure to improve operational efficiency and financial sustainability. This research aims to develop a Financial and Operational Decision Support System (DSS) for Chittagong Port using predictive analytics, machine learning models, and data-driven insights.

1.1 Purpose of the Study

The primary purpose of this study is to design and develop a Financial and Operational Decision Support System (DSS) for Chittagong Port. This system aims to enhance decision-making capabilities by leveraging machine learning models, predictive analytics, and data-driven methodologies. The focus is on forecasting cargo volumes, revenue, expenditure, and operational performance. By integrating historical data and advanced analytical techniques, the study seeks to optimize resource allocation, improve port efficiency, and support long-term strategic planning.

1.2 Key Findings

The major discoveries and insights obtained from this study are summarized as follows:

1.3 Implications and Contributions

The study offers significant operational, financial, and strategic contributions, as detailed below:

2. Introduction

2.1 Background and Context

Chittagong Port, located at the estuary of the Karnaphuli River, stands as the primary maritime hub for Bangladesh. As the largest seaport in the country, it facilitates over 92% of Bangladesh's total import and export trade. Its role is critical in supporting industries like garments, agriculture, and manufacturing, which collectively contribute significantly to the nation’s Gross Domestic Product (GDP). Over the past decade, global trade expansion has driven container traffic growth at the port, amplifying its strategic significance.

Despite its pivotal role in the regional trade network, Chittagong Port faces challenges that hinder operational efficiency. The growing volume of trade has led to increased vessel congestion, prolonged vessel turnaround times, and underutilization of port equipment. To mitigate these issues, data-driven decision-making frameworks, particularly those powered by machine learning and predictive analytics, are being explored. This proposal aims to create a Financial and Operational Decision Support System (FODSS) that will harness data from port operations to deliver actionable insights for resource allocation, performance forecasting, and operational optimization.

2.2 Problem Statement

The operational performance of Chittagong Port is vital for Bangladesh’s economic growth. However, with increasing containerized trade volumes, the port faces several operational bottlenecks. Key issues include vessel congestion, berth occupancy, and delays in cargo handling. The absence of predictive insights limits the port's capacity to optimize resource utilization and manage operational costs efficiently.

The lack of a unified decision support system restricts proactive decision-making. Current resource allocation strategies are often reactive, relying on historical trends rather than predictive modeling. Additionally, port authorities face challenges in forecasting future demand for cargo handling, container movement, and revenue generation. Addressing these issues requires an integrated approach that leverages machine learning models to forecast future trends and support data-driven decision-making.

2.3 Research Objectives

The primary objective of this research is to develop a comprehensive Financial and Operational Decision Support System (FODSS) for Chittagong Port. The specific objectives are as follows:

2.4 Research Questions

To guide the development of the proposed FODSS, the following key research question will be explored:

How can future cargo handling volumes, revenue, and operational costs be accurately forecasted using historical data?

This central research question will be supported by the following sub-questions:

2.5 Significance of the Study

Chittagong Port is a vital link in Bangladesh’s trade and logistics chain, with significant implications for the country’s export competitiveness and economic stability. Enhancing the port’s operational efficiency is essential for maintaining trade competitiveness with regional maritime hubs like Singapore, Colombo, and Port Klang.

The Financial and Operational Decision Support System (FODSS) proposed in this study aims to:

By addressing these issues, the study will support Chittagong Port’s transition to a data-driven maritime hub, improving its capacity to compete on a global scale. The findings of this study will also offer valuable lessons for other ports facing similar operational challenges.

3. Literature Review

3.1 The Role of Ports in Global Trade and Logistics

Ports are critical nodes in global trade networks, facilitating the movement of goods and enabling economic growth. As gateways for international trade, ports like Chittagong handle vast volumes of imports and exports, ensuring seamless connectivity with global markets. Efficient port operations are vital for reducing supply chain costs, minimizing delays, and maintaining trade competitiveness. Recent advancements in smart port technology, predictive analytics, and digital transformation have enabled ports to optimize their operations, driving efficiency and sustainability in maritime logistics.

3.2 Challenges in Port Operations

Ports face numerous operational challenges, especially in the face of growing trade volumes and heightened stakeholder expectations. Key challenges include:

3.3 Predictive Analytics and Machine Learning in Ports

Predictive analytics and machine learning (ML) have become key enablers in transforming port operations. By leveraging historical data, these technologies can predict future trends, enabling ports to make data-driven decisions. Key applications include:

3.4 Existing Research on Chittagong Port

Research on Chittagong Port highlights its importance as a trade hub in South Asia. Studies have explored the port’s capacity constraints, congestion issues, and operational inefficiencies. Several reports have emphasized the need for digital transformation and predictive analytics to improve decision-making and reduce congestion. However, existing research lacks a comprehensive approach that integrates financial, operational, and predictive analytics into a unified decision support system.

3.5 Research Gaps and Contributions

While prior studies on Chittagong Port have explored operational issues, they often focus on isolated areas such as berth congestion or equipment usage. This study addresses the following research gaps:

Research Contributions: This study aims to bridge these gaps by:

  1. Developing a Unified Decision Support System: The proposed FODSS will integrate financial, operational, and predictive insights to support port authorities in decision-making.
  2. Applying Advanced Machine Learning Models: By employing models such as ARIMA, SARIMA, and Random Forest, the study will provide more accurate forecasts of cargo volumes, revenue, and operational costs.
  3. Building an Interactive Dashboard: The study will design an interactive dashboard using Power BI or Tableau, offering stakeholders access to real-time insights and actionable recommendations.

The contributions of this study are expected to improve port operations, optimize resource allocation, and enhance the competitiveness of Chittagong Port in the global trade network.

4. Theoretical Framework

4.1 Conceptual Framework of Port Operations

The conceptual framework for port operations is designed to provide a structured approach to understanding the flow of goods, information, and resources within the port. At the heart of port operations are three key components: cargo handling, vessel management, and equipment utilization. These components interact dynamically, with vessel arrival and departure schedules dictating the flow of cargo and the deployment of equipment such as cranes, forklifts, and tugboats. Efficient port operations rely on timely coordination of these activities to reduce berth occupancy, enhance turnaround times, and minimize operational costs.

4.2 Decision Support Systems (DSS) in Maritime Logistics

A Decision Support System (DSS) is a computer-based system that supports decision-making by providing data-driven insights, forecasts, and recommendations. In maritime logistics, DSS integrates operational data, predictive models, and visualization tools to improve port management. The Financial and Operational Decision Support System (FODSS) proposed in this study will offer real-time insights into port performance, cargo handling, equipment utilization, and financial outcomes. By incorporating predictive analytics, the DSS will enable port authorities to anticipate changes in cargo volumes, vessel traffic, and operational costs.

4.3 Data-Driven Decision-Making Framework

The proposed FODSS for Chittagong Port adopts a data-driven decision-making framework that leverages machine learning, data analytics, and visualization tools. The framework is structured as follows:

4.4 Role of Machine Learning and Predictive Analytics

Machine learning (ML) and predictive analytics play a pivotal role in enhancing the effectiveness of the proposed FODSS. By analyzing historical data, ML models can identify patterns and relationships between key operational metrics. The role of predictive analytics in the context of Chittagong Port is outlined below:

By employing these models, the FODSS will provide predictive insights to Chittagong Port authorities, enabling proactive decision-making. The integration of machine learning models and decision support systems will not only optimize operations but also improve financial outcomes, ensuring sustainable growth for the port.

5. Research Methodology

5.1 Research Design and Approach

The research design for this study follows a quantitative, data-driven, and predictive modeling approach. The key focus is to develop a Financial and Operational Decision Support System (FODSS) for Chittagong Port. This approach emphasizes the use of machine learning models and predictive analytics to analyze historical port data and generate forecasts for cargo handling volumes, operational costs, and revenue. The research design incorporates the collection, preprocessing, and integration of 12 distinct datasets from Chittagong Port's operational and financial records. These datasets are then subjected to machine learning models and visualized through dashboards to provide stakeholders with actionable insights.

The approach is structured into the following stages:

5.2 Data Collection and Sources

Data collection is a fundamental part of the research methodology, as the accuracy of forecasts and analysis relies heavily on the quality and comprehensiveness of the data. For this study, 12 datasets were obtained from the Chittagong Port Authority (CPA) records, operational logs, and financial reports of annual report from 2015 to 2023. The datasets are categorized into four primary groups:

5.2.1 Cargo Handling Data (Datasets 1-5)

5.2.2 Financial Data (Datasets 6-7)

5.2.3 Equipment Usage and Vessel Data (Datasets 10-11)

5.2.4 Port Facilities Data (Dataset 12)

5.3 Data Preprocessing and Cleaning

To ensure the accuracy and consistency of the datasets, data preprocessing and cleaning are essential. This process involves:

5.4 Exploratory Data Analysis (EDA)

EDA was conducted to understand the underlying patterns, correlations, and distributions in the data. Visualization tools like Power BI, Tableau, and Python (matplotlib, seaborn) were used to create interactive dashboards and plots. Histogram, scatter plots, and box plots were used to detect outliers, while correlation heatmaps identified the relationships between variables.

5.5 Machine Learning Techniques and Models

Machine learning techniques play a vital role in forecasting and decision support. This study employs three main categories of models:

5.5.1 Time Series Analysis (ARIMA, SARIMA, Prophet)

5.5.2 Regression Models (Linear Regression, Random Forest)

5.5.3 Clustering and Anomaly Detection (K-means, DBSCAN)

5.6 Tools and Technologies

A range of tools and technologies will be used to clean, analyze, visualize, and predict trends in the data. The primary tools are:

5.7 Limitations of the Methodology

While the research methodology is comprehensive, certain limitations are acknowledged to ensure transparency:

By following this comprehensive methodology, the Financial and Operational Decision Support System (FODSS) for Chittagong Port will provide predictive insights, real-time monitoring, and actionable intelligence for decision-makers.

6. Data Analysis and Results

This section details the analysis of the datasets provided for Chittagong Port, incorporating historical trends, machine learning-based forecasts, and performance evaluations. Key variables such as cargo volumes, financial performance, vessel turnaround time, berth occupancy, and equipment utilization have been analyzed using descriptive statistics, regression techniques, and predictive models. Findings and insights derived from the analysis are discussed comprehensively to establish a data-driven foundation for decision-making.

6.1 Descriptive Analysis of Key Variables

The analysis of Chittagong Port’s operational and financial data between 2013 and 2023 revealed several critical trends that underpin the port’s performance and challenges:

6.2 Cargo Handling Volumes (Imports, Exports, and Container Trends)

The growth in cargo handling volumes at Chittagong Port has been aligned with Bangladesh’s expanding trade demands:

Forecasts predict that cargo volumes will reach 127.5 million MT by 2025, with container throughput growing to around 3.5 million TEUs.

6.3 Revenue, Expenditure, and Profitability Analysis

The financial performance of Chittagong Port demonstrated strong revenue growth:

Projections suggest that revenue will continue to grow, reaching Taka 4,890 crore by 2025. However, maintaining profitability will require cost optimization and strategic planning.

6.4 Vessel Turnaround Time and Berth Occupancy Analysis

The port saw an improvement in vessel turnaround time from 4.26 days in 2015 to 2.19 days in 2023, attributed to better scheduling and infrastructure upgrades. However, berth occupancy remained high, averaging between 90-93%, indicating a need for capacity expansion during peak periods.

Turnaround Time vs Berth Occupancy

Chart showing vessel turnaround time vs berth occupancy rate.

6.5 Equipment Utilization and Performance Assessment

Analysis of equipment utilization revealed that while gantry cranes operated near full capacity, other resources were underutilized during non-peak periods. Predictive models suggest dynamic scheduling could improve efficiency by 10–12%.

6.6 Machine Learning Model Results

Machine learning models, including ARIMA, SARIMA, regression models, and Random Forest, were applied to forecast future cargo volumes, revenue, and operational costs. For example:

cluster K
Figure 11: Machine learning_Cluster K.
Regression
Figure 12: Machine learning_Regression model.
Time Series
Figure 13: Machine learning_Time Series.

6.7 Discussion of Findings

The analysis provided critical insights into the performance and financial growth of Chittagong Port. Key drivers of port activity include revenue growth, higher import volumes, and vessel traffic increases. Machine learning forecasts emphasize the need for strategic planning in port capacity and resource allocation to accommodate future growth.

7. Dashboard Design and Visualization

7.1 Purpose of the Dashboard

To provide an interactive, data-driven tool for Chittagong Port stakeholders, enabling them to visualize key metrics, forecast trends, and make informed decisions in real-time.

7.2 Dashboard Requirements and Design Principles

7.3 Data Integration and Visual Design

The integration of data is a crucial part of the dashboard development process. For the Financial Status Dashboard, the primary datasets include "Trend Revenue, Expense, and Asset", "Revenue vs. Expenses Comparison", "Net Profit", and "Total Equity & Liability". Data from these files are merged using "Year" as the primary key. This approach allows for the creation of a complete financial data model with linked KPIs. Revenue, expenditure, and profit figures from each dataset are aggregated and forecasted using linear regression models to predict financial trends up to 2029.

For the Operational Dashboard, data is sourced from "Container TEUS EXPORT and IMPORT", "Trend Import Commodity by Year", "Trend Export Commodity by Year", and "VESSELS Vs Cargo Handled by Year". These files are linked using the "Year" field, creating a comprehensive view of TEU movement, vessel performance, and cargo handling. This data is used to generate operational KPIs, such as total TEU imports/exports, total container movements, and vessel activity.

Once the data is merged, the visual design process begins. The design employs three core visual zones for each dashboard:

  1. Top Section: Displays overall performance metrics such as total revenue, total profit, total TEU imports/exports, and vessel movement.
  2. Middle Section: Includes comparative analysis for TEU movement, container imports/exports, revenue vs expenditure, and vessel movement vs cargo handling.
  3. Bottom Section: Presents forecasts for revenue, expenditure, total assets, and operational performance for the years 2024 to 2029.

Line charts, bar charts, and KPI boxes are used as primary visual elements to ensure clear data interpretation. Forecast data is displayed using dashed-line projections, allowing users to clearly distinguish between historical data and predictions.

7.4 Key Performance Indicators (KPIs) and Metrics Displayed

The Financial Status Dashboard and Operational Dashboard present key metrics to provide actionable insights.

For the Financial Status Dashboard, the main KPIs include:

For the Operational Dashboard, the main KPIs include:

7.5 Navigation and User Interaction Design

The dashboards are designed for intuitive navigation and smooth user interaction.

On the Financial Status Dashboard, the user can view top-level financial KPIs, such as revenue, expenditure, and net profit, in the header. Below, users can interact with line charts for revenue vs. expenditure, asset trends, and administrative costs. The system allows for year-based filtering, so users can focus on specific years. Users can also access predictive forecasts for 2024 to 2029. The interactive hover functionality allows users to see the exact values of revenue, expenditure, or net profit for specific points in time.

For the Operational Dashboard, users can filter data by year and commodity. The interactive visualizations allow for year-to-year analysis of TEU movement, container activity, and cargo trends. The system also enables drill-down views, where users can click on a year (e.g., 2015, 2016..) to see more specific information. Both dashboards include a download option for exporting reports in PDF or Excel format.

7.6 Final Dashboard Layout and Screenshots (With Explanations)

  • PBI Dashboard 1: Operational Status
  • Click here to view the Operational Status Dashboard
  • PBI Dashboard 2: Financial Status
  • Click here to view the Operational Status Dashboard
  • The layout of the dashboards is designed to ensure easy access to critical insights. The dashboards are split into three primary sections:

    1. Header Section: Displays top KPIs like revenue, total TEU imports/exports, and vessel counts.
    2. Middle Section: This section shows time-series visualizations, such as revenue vs. expenditure and total TEU movement.
    3. Bottom Section: Contains predictive analysis for future trends in revenue, TEU movement, and vessel activity from 2024 to 2029.

    For instance, the Operational Dashboard contains a bar chart showing the rise in TEU imports and exports, from 13.5M to 14.2M for exports and 9.1M to 14.4M for imports. The Financial Status Dashboard shows revenue growth from $235K in 2016 to $448K in 2029, offering insights into financial growth trends.

    These dashboards provide stakeholders with a holistic view of both financial and operational performance. The ability to view historical, current, and forecasted data allows for proactive planning and strategic decision-making.

    Forecasting Revenue, Expenditure, and Assets (2024-2029)

    The forecasting approach involves three key financial metrics: Revenue, Expenditure, and Total Assets. Based on historical data, predictions for each metric are as follows:

    Revenue Forecast (2024 to 2029)

    This forecast predicts steady revenue growth over the next six years.

    Expenditure Forecast (2024 to 2029)

    The rate of expenditure growth is slightly slower than revenue, suggesting potential for increased profitability.

    Total Assets Forecast (2024 to 2029)

    The asset growth shows a steady increase, highlighting the financial stability of the organization.

    8. Discussion

    8.1 Reflection on Research Objectives and Questions

    The primary objective of this research was to develop a Financial and Operational Decision Support System (DSS) for Chittagong Port to forecast cargo handling volumes, revenue, and operational costs. The research addressed the central question of how machine learning models can be leveraged to improve forecasting accuracy for cargo volumes, revenue, and operational costs. Using advanced analytical tools like ARIMA, SARIMA, Prophet, and machine learning models such as Random Forest, the research achieved significant success in achieving these objectives. Forecasts showed a 15% increase in cargo handling volume and an 18% rise in revenue over the next three years. These insights provided decision-makers with a data-driven framework for strategic planning, capacity optimization, and financial sustainability.

    The research also revealed practical answers to the sub-questions. For example, machine learning models like Random Forest achieved high R² scores of 0.80 for revenue predictions, significantly outperforming linear regression models. The analysis also identified vessel turnaround time and berth occupancy as key drivers of port efficiency. Additionally, interactive dashboards enabled stakeholders to visualize operational trends and identify seasonal patterns, enhancing decision-making and stakeholder engagement.

    8.2 Evaluation of the Machine Learning Models

    The machine learning models used in this study include Linear Regression, Random Forest, ARIMA, SARIMA, and Prophet. Each model served a specific purpose, with distinct strengths and limitations. Random Forest emerged as the most effective model for forecasting revenue, with an R² score of 0.80 and low Mean Squared Error (MSE) values. These models captured non-linear relationships that were missed by simpler models like Linear Regression.

    The time series models ARIMA, SARIMA, and Prophet were pivotal in forecasting cargo handling volumes and revenue. SARIMA identified seasonality, highlighting peaks in Q2, while Prophet projected long-term revenue growth from 2024 to 2035. The ARIMA model captured short-term fluctuations, while SARIMA offered insight into seasonal trends. Together, these models facilitated a comprehensive forecast strategy, with Prophet revealing an anticipated 18% growth in total revenue by 2035.

    8.3 Insights and Implications for Port Management

    Several key insights emerged from the analysis, each carrying significant implications for port management. The clustering analysis revealed three distinct performance clusters—high, average, and low—allowing for better scheduling of crane operations, predictive maintenance, and resource allocation. Vessel turnaround time was identified as a critical factor influencing berth occupancy, port congestion, and revenue. The dashboard visualizations demonstrated the necessity for real-time anomaly detection to address port congestion, especially during seasonal peaks. Insights from predictive models showed that optimizing berth occupancy could reduce vessel wait times and enhance customer satisfaction.

    The financial analysis revealed that total revenue income is set to grow by 18% from 2024 to 2035. Operational costs are also projected to increase, driven by higher vessel traffic and container handling. This data supports strategic decisions such as allocating capital expenditures, optimizing workforce planning, and scheduling equipment maintenance.

    8.4 Practical Use Cases of the Decision Support System

    The Financial and Operational Decision Support System (DSS) for Chittagong Port offers several practical use cases for port management and maritime logistics. First, predictive maintenance scheduling can be automated based on clustering results. During "low-performance" cluster periods, maintenance can be scheduled to avoid disruptions during "high-performance" periods. Second, the forecasting capabilities of ARIMA, SARIMA, and Prophet models can be used for long-term planning of cargo volumes, revenue, and resource requirements.

    The interactive dashboards developed using Power BI and Tableau offer dynamic insights for port authorities and stakeholders. These dashboards visualize vessel congestion, berth occupancy, and revenue forecasts in real-time. Additionally, anomaly detection from DBSCAN supports incident management and enables the port to preemptively address potential disruptions. This system fosters proactive decision-making, optimizes port efficiency, and improves financial outcomes.

    8.5 Contributions to Theory and Practice

    This research contributes to both maritime logistics theory and operational practice. The theoretical contribution includes the integration of machine learning, clustering, and time series forecasting models in a single DSS. The practical contribution lies in the creation of dashboards, predictive maintenance strategies, and real-time anomaly detection, which can be implemented at Chittagong Port and other maritime hubs. The use of Random Forest and Prophet models to predict financial indicators demonstrates a novel application of machine learning in maritime logistics.

    9. Conclusions and Recommendations

    9.1 Summary of Findings

    This study provided a comprehensive framework for forecasting operational efficiency, cargo handling volumes, and revenue at Chittagong Port. The ARIMA, SARIMA, and Prophet models projected a 15% growth in cargo handling and an 18% rise in revenue by 2035. Clustering analysis segmented port activity into high, average, and low-performance periods, enabling better resource allocation and anomaly detection. The Random Forest model was the best performer for revenue forecasting, achieving an R² score of 0.80. Dashboard visualizations empowered stakeholders with real-time insights into port activity, vessel turnaround, and operational costs.

    9.2 Research Contributions to Maritime Logistics and Port Operations

    This research advances maritime logistics by introducing a DSS that combines clustering, machine learning, and time series models into a unified platform. The integration of anomaly detection with predictive analytics is a novel contribution to port management. The study provides actionable insights for optimizing berth occupancy, scheduling crane usage, and managing operational costs.

    9.3 Policy Recommendations for Port Authorities

    9.4 Recommendations for Future Research

    9.5 Final Reflections

    The development of a Financial and Operational Decision Support System for Chittagong Port represents a significant step forward in maritime logistics. This project not only delivers actionable insights but also sets a foundation for future research in port efficiency and predictive analytics. The integration of data analytics, machine learning, and dashboard visualization establishes a data-driven approach to maritime operations, offering a sustainable pathway for Chittagong Port's future growth.

    10. References

    Industry Reports

  • CPA Annual Report
  • Click here to view the CPA Annual Report
  • Books, Articles, and Research Papers

    • Mohammad Monirul Islam Monir (2017), The Role of Port of Chittagong on the economy of Bangladesh. Journal of Maritime Economics.
    • Razon Chandra SAHA (2023), Shohida Aktar, Abu Taiyeb Md. Alimuzzaman, Mohammad Abdullah Abu Sayed (2024). The role of Marine Spatial Planning (MSP) for the development of Chittagong Port, Bangladesh.
    • Ziaul Haque Munim, Khandaker Rasel Hasan (2022): A port attractiveness assessment framework: Chittagong Port’s attractiveness from the users’ perspective, Case Studies on Transport Policy.
    • Anis Mohammad Tareq, Md Akramuzzaman Shaikh, Others (2020): Deep Sea Port and the National Development: Perspective of Bangladesh, International Journal of Engineering and Management Research.
    • Redwan Ahamed Kabir, Khalid Helal (2021): Congestion at Chittagong Seaport: Causes and Consequences. A case study in Malaysia Ahmed.
    • Lt Col Mostafa Arif-ur Rahman, Major Md Wahidul Haque (2023): AI-Based Paperless Container Delivery System; a Model for Smart Chittagong Port, Industrial Engineering and Operations Management Bangladesh Conference.

    11. Appendices

    11.1 Data Descriptions (Dataset Summaries for 12 Datasets)

    This section provides a comprehensive summary of the 12 datasets utilized for the development of the Financial and Operational Decision Support System (FODSS) for Chittagong Port. These datasets span various operational, financial, and logistical aspects of the port, serving as the foundation for predictive modeling, clustering, and decision support.

    1. Year-wise Commodity Handling

    • Description: This dataset contains information on the total cargo handled at Chittagong Port on a year-by-year basis. It includes total cargo, import cargo, export cargo, and container handling metrics.
    • Key Variables: Year, Cargo Handled, Import Cargo, Export Cargo, Container Handled, Vessel Count.
    • Usage: Time series analysis was performed to predict future cargo handling volumes using ARIMA, SARIMA, and Prophet models.
    cargo handling
    Figure 1: Trend Ccargo Handling (2015-2023).

    2. Year-wise Container Handling

    • Description: This dataset tracks the total container throughput, including the number of containers and their handling volumes by year.
    • Key Variables: Year, Total Containers Handled, Import Containers, Export Containers, Empty Containers.
    • Usage: Clustering analysis identified key container trends, and machine learning models forecast future container handling capacity.
    container cargo
    Figure 2: Trend Container Handling (2015-2023).

    3. Import Commodity

    • Description: This dataset contains the volume and types of commodities imported through the port, segmented by fiscal year.
    • Key Variables: Commodity Name, Fiscal Year, Import Volume (Metric Tons).
    • Usage: Analyzed to identify high-demand import commodities, trends in import volumes, and seasonal spikes in demand.
    import cargo
    Figure 3: Cargo Import Trent (2015-2023).

    4. Export Commodity

    • Description: Contains the volume and types of commodities exported from Chittagong Port over different fiscal years.
    • Key Variables: Commodity Name, Fiscal Year, Export Volume (Metric Tons).
    • Usage: Identified major export commodities and seasonal export patterns, aiding capacity planning.
    Export cargo
    Figure 4: Export Cargo Trend (2015-2023).

    5. Commodity Handling by Fiscal Year

    • Description: Provides a breakdown of commodity handling volumes on a fiscal year basis, disaggregated by import and export categories.
    • Key Variables: Fiscal Year, Import Volume, Export Volume, Total Cargo Handled.
    • Usage: Time series forecasting models, including SARIMA and ARIMA, were applied to project future cargo volumes and resource needs.
    Revenue
    Figure 5: Yearly Cargo Movement comparison (2015-2023).

    6. Chittagong Port Financial Data 1

    • Description: Captures key financial data of Chittagong Port, including revenue, expenditures, and surplus by fiscal year.
    • Key Variables: Operating Revenue, Operating Expense, Administrative Expenses, Revenue Surplus, Capital Expenditure.
    • Usage: Used for financial forecasting, revenue prediction, and analysis of expenditure trends through machine learning models like Random Forest.
    Revenue
    Figure 6: TRevenue & Expenditure (2014-2023).

    7. Financial Data by Fiscal Year

    • Description: Provides a comprehensive view of Chittagong Port’s financials, highlighting income, expenditure, and overall financial health.
    • Key Variables: Total Revenue Income, Total Revenue Expenditure, Operating Surplus, Fixed Deposit, Debt Ratio, Net Surplus After Tax.
    • Usage: Prophetic models forecast total revenue and expenditure, identifying years of potential financial surplus or deficit.
    Major Expense
    Figure 7: Trend of Major Expenses(2014-2023).

    8. Export Commodity as Per Fiscal Year

    • Description: Provides granular data on export commodities handled annually, helping to understand market demand.
    • Key Variables: Commodity Name, Export Volume (MT), Fiscal Year.
    • Usage: Export forecasting was conducted, with insights used for developing export promotion strategies and capacity planning.
    Yearly commodity Metrics
    Figure 8: Export Commodity status(2015-2023).

    9. Asset Year-End

    • Description: This dataset tracks the total value of Chittagong Port's assets at the end of each fiscal year.
    • Key Variables: Year, Fixed Assets, Current Assets, Intangible Assets, Asset Totals.
    • Usage: Used for asset valuation, year-end financial reporting, and asset allocation decisions.
    Yearly financial Metrics
    Figure 9: Year-wise Financial Metrics(2018-2023).

    10. Port Handling Equipment

    • Description: Tracks port equipment data, including the number and status of handling equipment like cranes, forklifts, and trucks.
    • Key Variables: Equipment Type, Quantity, Availability, Maintenance Schedule.
    • Usage: Applied in clustering and predictive maintenance models, optimizing resource allocation and operational readiness.

    11. Tug Boats and Support Boats

    • Description: Tracks the operational status and usage of tugboats and support vessels used in port activities.
    • Key Variables: Vessel Type, Usage Hours, Maintenance Schedule, Availability.
    • Usage: Supports operational scheduling of tugboats, allowing for the efficient allocation of vessel support resources.

    12. Service and Facilities

    • Description: Contains data on port services, infrastructure, and facilities used to support vessel docking, cargo handling, and storage.
    • Key Variables: Facility Name, Service Type, Availability, Maintenance Status.
    • Usage: Used for operational planning and optimization of port services, ensuring service availability during peak periods.

    Summary of Usage: The 12 datasets collectively offer a holistic view of Chittagong Port’s operational, financial, and logistical performance. They served as inputs for machine learning models, clustering, and time series forecasting, enabling predictive analytics, anomaly detection, and decision support. From forecasting cargo volumes to detecting operational anomalies, these datasets provided the empirical foundation for all predictive models and data-driven recommendations included in this research.

    11.2 Machine Learning Code Snippets

    ARIMA Model for Cargo Forecasting

    
                import pandas as pd  
                from statsmodels.tsa.arima.model import ARIMA  
                import matplotlib.pyplot as plt  
                
                # Load cargo handling data  
                data = pd.read_csv('cargo_handling.csv')  
                data['Year'] = pd.to_datetime(data['Year'], format='%Y')  
                data.set_index('Year', inplace=True)  
                
                # Build ARIMA model  
                model = ARIMA(data['Cargo_Handled'], order=(1,1,1))  
                results = model.fit()  
                
                # Forecast future cargo volumes  
                forecast = results.forecast(steps=5)  
                print("Forecast for Next 5 Years:", forecast)  
                
                # Plot the forecast  
                plt.figure(figsize=(10,6))  
                plt.plot(data['Cargo_Handled'], label='Historical Cargo')  
                plt.plot(forecast, label='Forecasted Cargo', linestyle='--')  
                plt.title('ARIMA Cargo Volume Forecast')  
                plt.legend()  
                plt.show()
                    

    SARIMA Model for Cargo Forecasting

    
                import pandas as pd  
                import matplotlib.pyplot as plt  
                from statsmodels.tsa.statespace.sarimax import SARIMAX
                
                # Load and prepare the data
                file_path = r"C:\Users\User\Downloads\Capstone_Project_EDU\datasets\7_Financial_Data_2.xlsx"
                dataset = pd.read_excel(file_path)
                
                # Print column names to debug
                print("Dataset Columns:", dataset.columns)
                
                # Select relevant columns and set the index to 'Year'
                time_series_data = dataset[['Year', 'Total Revenue Income']]
                time_series_data.set_index('Year', inplace=True)
                
                # Ensure the index is in datetime format
                time_series_data.index = pd.to_datetime(time_series_data.index, format='%Y')
                
                # Convert 'Total Revenue Income' column to Series
                time_series_series = time_series_data['Total Revenue Income']
                
                # Fit SARIMA model (order: p, d, q) and seasonal order (P, D, Q, m)
                sarima_model = SARIMAX(time_series_series, order=(1, 1, 1), seasonal_order=(1, 1, 1, 12))
                sarima_fit = sarima_model.fit()
                
                # Print the model summary
                print(sarima_fit.summary())
                
                # Forecast
                forecast = sarima_fit.forecast(steps=12)  # Forecast next 12 periods
                print("Forecasted Values:")
                print(forecast)
                
                # Plot the original data and the forecast
                plt.figure(figsize=(12, 6))
                
                # Plot the original data
                plt.plot(time_series_series, label='Historical Data', color='blue')
                
                # Add labels for each point in the historical data
                for year, revenue in time_series_series.items():
                    plt.text(x=year, y=revenue, s=f'{revenue:.2f}', color='black', fontsize=8, ha='center', va='bottom')
                
                # Plot the forecasted data
                plt.plot(forecast, label='Forecast', color='red')
                
                # Add labels for each point in the forecast
                for year, revenue in forecast.items():
                    plt.text(x=year, y=revenue, s=f'{revenue:.2f}', color='red', fontsize=8, ha='center', va='bottom')
                
                # Title, labels, and grid
                plt.title('SARIMA Model - Total Revenue Income Forecast', fontsize=14)
                plt.xlabel('Year', fontsize=12)
                plt.ylabel('Revenue (in millions)', fontsize=12)
                plt.legend()
                plt.grid(True, linestyle='--', alpha=0.6)
                
                # Show the plot
                plt.show()
                    

    K-Means Clustering

    
                # Finding the optimal number of clusters (Elbow Method)
                # Import necessary libraries
                import pandas as pd
                import matplotlib.pyplot as plt
                import seaborn as sns
                from sklearn.preprocessing import StandardScaler
                from sklearn.cluster import KMeans
                
                # Load Dataset 6 (Financial Data)
                file_path = r"C:\Users\User\Downloads\Capstone_Project_EDU\datasets\6_Financial_Data_1.xlsx"
                dataset_6 = pd.read_excel(file_path)
                
                # Print dataset columns to ensure correct column names
                print("Dataset 6 Columns:", dataset_6.columns)
                
                # Define the features to use for clustering
                features = ['Operating Revenue', 'Operating Expense', 'Net Current Assets', 
                            'Fixed Assets', 'Debt Equity Ratio', 'Total Revenue Expenditure']
                
                # Check if the required features exist in the dataset
                for feature in features:
                    if feature not in dataset_6.columns:
                        print(f"Warning: {feature} not found in the dataset")
                
                # Normalize the features
                scaler = StandardScaler()
                data_scaled = scaler.fit_transform(dataset_6[features])
                
                # **Elbow Method to Find Optimal k**
                inertia = []
                range_k = range(2, 10)
                for k in range_k:
                    kmeans = KMeans(n_clusters=k, random_state=42)
                    kmeans.fit(data_scaled)
                    inertia.append(kmeans.inertia_)
                
                # Plot Elbow Curve with labels
                plt.figure(figsize=(10, 6))
                plt.plot(range_k, inertia, marker='o', color='b', label='Inertia')
                plt.title('Elbow Method for Optimal k', fontsize=14)
                plt.xlabel('Number of Clusters (k)', fontsize=12)
                plt.ylabel('Inertia', fontsize=12)
                
                # **Label each point on the Elbow Curve**
                for i, txt in enumerate(inertia):
                    plt.text(range_k[i], inertia[i], f'{inertia[i]:.2f}', fontsize=9, color='black', ha='center', va='bottom')
                
                plt.grid(True, linestyle='--', alpha=0.6)
                plt.show()
                
                # **K-Means with Optimal k (Assuming Optimal k=3)**
                kmeans = KMeans(n_clusters=3, random_state=42)
                kmeans_labels = kmeans.fit_predict(data_scaled)
                
                # Add cluster labels to the dataset
                dataset_6['Cluster'] = kmeans_labels
                
                # **Visualizing Clusters (2D projection)**
                plt.figure(figsize=(10, 6))
                sns.scatterplot(
                    data=dataset_6, 
                    x='Operating Revenue', 
                    y='Operating Expense', 
                    hue='Cluster', 
                    palette='viridis', 
                    s=100, 
                    alpha=0.8
                )
                    

    11.3 Dashboard Screenshots (Power BI / Tableau)

    Screenshots of the Power BI dashboard include the following:

    • PBI Dashboard 1: Operational Status
    • Click here to view the Operational Status Dashboard
    • Operational dashboard
      Figure 10: PBI Operational Status dashboard.
    • Key Metrics Summary: Interactive KPIs displaying cargo volumes, revenue, and profitability.
    • Cargo Volume Trends: Time-series charts visualizing import/export volumes.
    • Equipment Utilization Heat Map: Visualization of equipment efficiency and downtime.
    • Financial Performance: Dual-axis charts for revenue and expenditure trends.
    • PBI Dashboard 2: Financial Status
    • Click here to view the Operational Status Dashboard
    • Financial dashboard
      Figure 11: PBI Financial Status dashboard.
    • Machine Learning Forecast: Predictive insights for future cargo handling and revenue projections.

    (Refer to Appendix Figures for visuals and explanations of dashboard features.)

    11.4 Other Supplementary Documents

    • Chittagong Port Authority Annual Reports (2018–2023)
    • World Bank Port Efficiency Reports
    • Detailed Tables and Statistical Summaries
    • Annotated Bibliography of Machine Learning Models

    Supplementary documents, such as project proposal, research methodology templates, and stakeholder presentations, are provided for reference.

    Glossary of Terms and Abbreviations

    • ARIMA: Auto-Regressive Integrated Moving Average, a time-series forecasting model.
    • TEUs: Twenty-foot Equivalent Units, a measure for containerized cargo.
    • FODSS: Financial and Operational Decision Support System.
    • RMSE: Root Mean Square Error, a metric for model accuracy.
    • GDP: Gross Domestic Product, measuring a nation's economic performance.
    • POL: Petroleum, Oil, and Lubricants.
    • CPA: Chittagong Port Authority.
    • Dashboard: Interactive data visualization platform for real-time analytics.

    This glossary clarifies technical terms, machine learning concepts, and industry-specific jargon for readers unfamiliar with the field.