1. Advana and Jupiter
Advana: DoD analytic environment
https://advana.data.mil/
        Databricks, DataRobot, Qlik, Tableau,
EDL, and Catalog
Jupiter: https://jupiter.data.mil/
        Navy’s cloud-based data lake and
analytics environment within ADVANA
        (ARES for the Army and
BLADE for the Air Force)
        AWS
GovCloud and operational on NIPR, SIPR
and JWICS networks
Service desk: https://advana.data.mil/landing/servicedesk
ADVANA knowledge: https://wiki.advana.data.mil/display/SDKB/Get+Help
Upcoming training: https://wiki.advana.data.mil/display/SDKB/Monthly+Training+Series
ADVANA training: https://wiki.advana.data.mil/display/SDKB/Get+Training
               https://wiki.advana.data.mil/display/SDKB/Beginner+Trainings
ADVANA office hours: https://wiki.advana.data.mil/display/SDKB/Office+Hours+Calendar
EDL for external data: https://edl.advana.data.mil/
Support team email: JupiterTechnicalSupport@us.navy.mil
2. Spatial data forecasting
NASA data: https://espo.nasa.gov/atom
The data downloaded from NASA is flight data to measure atmospheric
tomography mission(pollution environmental factors). Flights will occur
in each of 4 seasons over a 4-year period. The deployment originated
from the Armstrong Flight Research Center in Palmdale, California, flew
north to the western Arctic, south to the South Pacific, east to the
Atlantic, north to Greenland, and returned to California across central
North America.
Flight1 with 3380 rows. The flight starts from 14:33:10 and
ends at 23:56:30
Made the forecasting more challenging that the rows of 1660~1680 is
replaced as zeros and see if advanced models can predict the missing
rows in the middle of the flight
## # A tibble: 21 × 12
## Index FlightNum Date Start End Lat Long Alt Press Temp
## <chr> <dbl> <dbl> <time> <time> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 A1_01660 1 20160729 19:09:40 19:09:50 5.17 -121. 192. 992. 298.
## 2 A1_01661 1 20160729 19:09:50 19:10:00 5.15 -121. 193. 992. 298.
## 3 A1_01662 1 20160729 19:10:00 19:10:10 5.14 -121. 194. 992. 298.
## 4 A1_01663 1 20160729 19:10:10 19:10:20 5.13 -121. 195. 992. 298.
## 5 A1_01664 1 20160729 19:10:20 19:10:30 5.12 -121. 194. 992. 298.
## 6 A1_01665 1 20160729 19:10:30 19:10:40 5.10 -121. 194. 992. 298.
## 7 A1_01666 1 20160729 19:10:40 19:10:50 5.09 -121. 195. 992. 298.
## 8 A1_01667 1 20160729 19:10:50 19:11:00 5.08 -121. 194. 992. 298.
## 9 A1_01668 1 20160729 19:11:00 19:11:10 5.07 -121. 194. 992. 298.
## 10 A1_01669 1 20160729 19:11:10 19:11:20 5.06 -121. 194. 992. 298.
## # ℹ 11 more rows
## # ℹ 2 more variables: groundDst <dbl>, time <dbl>
## # A tibble: 21 × 12
## Index FlightNum Date Start End Lat Long Alt Press Temp
## <dbl> <dbl> <dbl> <time> <time> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 1660 1 20160729 19:09:40 19:09:50 0 0 0 992. 298.
## 2 1661 1 20160729 19:09:50 19:10:00 0 0 0 992. 298.
## 3 1662 1 20160729 19:10:00 19:10:10 0 0 0 992. 298.
## 4 1663 1 20160729 19:10:10 19:10:20 0 0 0 992. 298.
## 5 1664 1 20160729 19:10:20 19:10:30 0 0 0 992. 298.
## 6 1665 1 20160729 19:10:30 19:10:40 0 0 0 992. 298.
## 7 1666 1 20160729 19:10:40 19:10:50 0 0 0 992. 298.
## 8 1667 1 20160729 19:10:50 19:11:00 0 0 0 992. 298.
## 9 1668 1 20160729 19:11:00 19:11:10 0 0 0 992. 298.
## 10 1669 1 20160729 19:11:10 19:11:20 0 0 0 992. 298.
## # ℹ 11 more rows
## # ℹ 2 more variables: groundDst <dbl>, time <dbl>
3. Modeling Algorithms
1. Kalman filter
:It is one of the most popular estimation algorithm for
guidance, navigation and control vehicle of aircraft and ships. I
believe we, Sea Range, uses Kalman filter for data products for
smoothing or other purpose.
2. LSTM : Long short
term memory. It is part of neural network, and used for natural language
process using input, output and forget gates and which makes it remember
values over time interval.
3. Arima/Sarima : The
most well-known time series model.
4. Ridge Regressor with Forecasting
distance : Autopilot modeling  https://abouttrading.substack.com/p/lets-use-ridge-regression-to-predict
4. DataRobot (ADVANA)
Training and Instruction:https://wiki.advana.data.mil/display/SDKB/DataRobot
https://wiki.advana.data.mil/display/SDKB/DataRobot+Platform+Demo
DataRobot’s time series models employ a data format where the length of
the future prediction horizon is not predetermined. Instead, the number
of steps to forecast is calculated based on the number of rows
available, which directly relates to the size and configuration of the
backtesting groups. Consequently, if a 20-step forecast is required, a
dataset with only 1159 rows might not provide enough data to execute the
algorithm effectively, potentially leading to a prediction sequence of
only 10 steps. Blue(actual), Pink(ARIMA/SARIMA),Green(DataRobot),
Salmon(Kalman) and Purple(LSTM)
5. How to Use DataRobot with ADVANA
(Jupiter)
1. Create an ADVANA account.
2. Request access to the following through
ADVANA:
    Databricks
    EDL (External Data Load - https://edl.advana.data.mil/)
    Data Catalog (if your data is in ADVANA Catalog)
    DataRobot
3. Load your data into DataRobot using one of the
following methods:
    Option 1: Upload to EDL and then connect to
DataRobot.
    Option 2: Directly upload to DataRobot.
    Option 3: Locate data through the Data Catalog and connect
to DataRobot.
4. Connect DataRobot to Databricks
(Optional):
    Verify the correct cluster is running.
    Create a token within Databricks.
    Establish a connection to the Data Catalog (if
applicable).
Can I use DataRobot without importing models to
Databricks?
Yes, you can directly upload your data into DataRobot and allow
it to run automatically.
Why connect to Databricks?
For implementing forecasting models, combining series of models,
model comparison, and exporting outputs to other ADVANA
tools.
6.
COSMOS(SageMaker)
https://app.cosmos.navy.mil
https://wdibo.navy.mil/esc?id=kb_article&sysparm_article=KB0010300&table=kb_knowledge&searchTerm=Cosmos
What is COSMOS?
COSMOS is a self-service Amazon Web Services (AWS) gov cloud
broker designed by and for the Navy. Â Cosmos includes AWS
services and SageMaker which is an integrated
development environment (IDE) for ML with features like code writing,
debugging, visualization, and collaboration.