Course Project: AI-Assisted Portfolio Construction and Backtesting

Executive Summary

This project demonstrates the end-to-end workflow of an AI-assisted quantitative investment process, spanning strategic ideation, empirically-grounded asset selection, algorithmic weight optimization, and rigorous historical backtesting. The central objective is to evaluate whether a systematic Low Volatility equity strategy — constructed from ten of the most liquid U.S. mega-capitalization stocks and calibrated to maximize the portfolio Sharpe Ratio — is capable of generating statistically meaningful abnormal risk-adjusted returns (Alpha) relative to the broad U.S. equity market, as proxied by the SPDR S&P 500 ETF Trust (SPY).

The intellectual contribution of this project lies not merely in the mechanical execution of quantitative finance routines, but in the deliberate and transparent integration of Artificial Intelligence as a collaborative research partner. Throughout this document, the analyst will demonstrate how AI was leveraged for strategic ideation, for stress-testing the investment thesis, for cross-validating the fundamental quality of selected assets, and for structuring the code architecture of the optimization and backtesting engine. The ultimate hypothesis is that a portfolio composed of structurally defensive, cash-flow-rich mega-cap equities, when optimally weighted to maximize risk-adjusted return, will exhibit a positive and statistically meaningful CAPM Alpha and a Sharpe Ratio that materially exceeds that of the passive S&P 500 benchmark over the trailing three-year evaluation window.

Part I: Investment Thesis & Logic

1.1 Strategy Description: The Low Volatility Anomaly

The Low Volatility Anomaly is one of the most well-documented and intellectually compelling paradoxes in modern empirical finance. Classical asset pricing theory, as codified in the Capital Asset Pricing Model (CAPM) developed by Sharpe (1964) and Lintner (1965), posits a linear, positive relationship between systematic risk (Beta) and expected return: higher risk must be compensated by higher expected returns in an efficient market. The empirical record, however, tells a persistently different story.

Beginning with the seminal work of Black, Jensen, and Scholes (1972), and corroborated extensively by Baker, Bradley, and Wurgler (2011) in their landmark Journal of Portfolio Management paper “Benchmarks as Limits to Arbitrage: Understanding the Low-Volatility Anomaly,” the evidence is clear: low-volatility and low-beta equities have historically delivered superior risk-adjusted returns compared to their high-volatility, high-beta counterparts. This empirical finding directly contradicts the Security Market Line’s prediction, making the Low Volatility Anomaly a genuine source of exploitable Alpha — or, at the very minimum, a powerful demonstration that the simple CAPM is a deeply incomplete description of actual risk-return dynamics.

Several structural, behavioral, and institutional mechanisms have been proposed to explain this anomaly’s persistence:

Leverage Constraints and the Demand for High-Beta Assets: Rational investors who require high returns but face borrowing restrictions or leverage limits (e.g., pension funds, retail investors) cannot lever up low-volatility portfolios to achieve their target returns. Instead, they are forced to overweight high-beta stocks, bidding up their prices and compressing their future expected returns. Low-volatility stocks are consequently under-owned and under-priced relative to what their fundamental quality warrants.
Agency Problems and Benchmark-Relative Mandates: The vast majority of institutional capital is managed against relative benchmarks (e.g., the S&P 500). Portfolio managers who deviate from the benchmark — even rationally — expose themselves to career risk if the deviation underperforms in the short run. This creates a systematic disincentive to overweight low-beta, low-volatility stocks that might cause the manager’s portfolio to lag in a rising market, further perpetuating the anomaly.
The Lottery-Ticket Effect and Investor Psychology: Behavioral finance research, particularly the work of Kumar (2009) on investor preference for lottery-like payoffs, demonstrates that retail investors systematically overpay for high-volatility, high-skewness stocks that offer small probabilities of extreme positive returns. This speculative demand inflates the prices of high-volatility assets and deflates those of low-volatility assets, creating a predictable and persistent return differential.
Quality as a Latent Factor: Low-volatility companies are, almost by construction, operationally stable, reliably profitable, and conservatively financed. Their cash flows are durable, their margins are protected by competitive moats, and their debt burdens are manageable. These fundamental characteristics — which also define the Quality factor — compound their risk-adjusted performance advantage over time. In the specific universe of U.S. mega-cap technology and healthcare equities selected for this portfolio, this quality dimension is exceptionally strong.

The investment thesis for this project is therefore grounded in a multi-factor rationale: by selecting the ten most liquid U.S. mega-cap stocks — assets whose scale and market positioning confer inherent earnings stability — and then applying a Maximum Sharpe Ratio optimization that explicitly penalizes portfolio-level volatility, we aim to capture the Low Volatility premium while simultaneously exploiting the quality characteristics embedded in these specific names. The Maximum Sharpe Ratio framework operationalizes the Low Volatility philosophy: it will mechanically under-weight the highest-volatility assets (such as TSLA and NVDA in high-dispersion regimes) and over-weight assets with superior risk-adjusted return profiles, producing a portfolio that is structurally tilted toward the low-volatility factor without requiring any explicit volatility constraint.

1.2 AI Collaboration Process

The following transcript documents the structured, multi-turn dialogue conducted with an AI language model to assist in strategy refinement, asset quality validation, and optimization architecture design. The dialogue reflects a genuine research process in which the AI was challenged to provide institutional-grade reasoning, not merely surface-level summaries.

— TURN 1: Strategy Ideation —

Analyst Prompt: “Act as a Senior Quantitative Portfolio Manager at a global asset management firm. I am designing an equity portfolio for an academic course project. I have a universe of 10 liquid U.S. mega-cap stocks: AAPL, MSFT, GOOGL, AMZN, TSLA, META, NVDA, JPM, V, and UNH. My investment philosophy is the Low Volatility Anomaly. Explain in detail why this philosophy is theoretically sound, and advise me on whether this specific universe is appropriate for implementing a low-volatility strategy.”

AI Response: “The Low Volatility Anomaly is one of the most empirically robust findings in the history of quantitative finance, first rigorously documented by Black, Jensen, and Scholes (1972) and later formalized by Frazzini and Pedersen (2014) in their Betting Against Beta framework. The core insight is that the CAPM’s prediction of a monotonically positive risk-return trade-off breaks down in practice: portfolios of low-beta, low-volatility stocks have historically delivered higher Sharpe ratios than high-beta counterparts, primarily because leverage-constrained institutional investors are structurally unable to arbitrage away the mispricing.

Regarding your specific universe: it is a nuanced but ultimately defensible choice. Strictly speaking, ‘mega-cap tech’ is not synonymous with ‘low volatility’ at the individual stock level — TSLA and NVDA, for example, are among the highest-volatility large-cap names in the U.S. market. However, a Maximum Sharpe Ratio optimization applied to this universe will naturally assign lower weights to these high-volatility names and higher weights to structurally lower-volatility assets like UNH (healthcare, counter-cyclical revenues), V (payment processing, toll-like business model), JPM (systemically important financial institution), and the more mature technology franchises like AAPL and MSFT. The result is a portfolio that is ‘low-volatility’ not by individual stock selection but by the optimization mechanism itself. This is arguably a more sophisticated implementation of the philosophy than a simple low-volatility screen, because it captures the full covariance structure of the asset universe.”

— TURN 2: Fundamental Factor Filtering & Quality Validation —

Analyst Prompt: “Now perform a fundamental quality audit of these 10 names. I need to validate that they collectively satisfy the quality criteria that the academic literature links to the Low Volatility premium — specifically: strong free cash flow generation, low-to-moderate financial leverage (Debt-to-Equity ratio), durable competitive advantages, and earnings stability. Identify the top quality names and flag any that are high-risk outliers I should be aware of.”

AI Response: “Conducting a quality audit across your ten-asset universe, here is my institutional assessment:

Tier 1 — Highest Quality (Core Low-Volatility Holdings): - MSFT (Microsoft): Near-perfect quality score. Azure cloud and Office 365 subscription revenues generate extraordinarily predictable, recurring free cash flow ($75B+ annually). Net cash position on balance sheet (more cash than debt). Earnings volatility is among the lowest in the S&P 500. This is the archetype of a ‘defensive growth’ asset. - AAPL (Apple): $100B+ in annual free cash flow, services segment growing at 15%+ annually providing revenue diversification from hardware cycles, $50B+ net cash. The ecosystem lock-in creates almost unparalleled earnings stability for a company at its scale. - V (Visa): Asset-light business model with zero credit risk on its balance sheet (it is a payment network, not a lender). EBITDA margins consistently above 65%. Debt levels are modest relative to cash generation. Revenue grows reliably with nominal GDP and consumer spending. - UNH (UnitedHealth Group): Healthcare is the most counter-cyclical sector in the S&P 500. UNH’s diversified revenue streams across insurance, pharmacy benefits management (OptumRx), and healthcare delivery (Optum Health) create significant earnings diversification. Debt-to-Equity is elevated but well-covered by stable operating cash flows.

Tier 2 — High Quality with Moderate Cyclicality: - GOOGL (Alphabet): Digital advertising is moderately cyclical, creating earnings variability in recessionary periods (see Q3-Q4 2022). However, the balance sheet is fortress-grade, with over $100B in net cash. Cloud (Google Cloud) is a powerful and growing free cash flow driver. - META (Meta Platforms): Underwent a dramatic balance sheet improvement post-2023 ‘Year of Efficiency.’ Free cash flow margins now exceed 30%. The primary risk is regulatory and reputational, not financial. - AMZN (Amazon): AWS is a structural free cash flow engine. Retail has historically been a low-margin drag, but the shift toward advertising (70%+ margin) and third-party services is transformative. Net leverage is modest. - JPM (JPMorgan Chase): The highest-quality money-center bank globally. However, it is inherently a leveraged financial institution with a Debt-to-Equity ratio that appears high in a traditional industrial context. This is a feature of banking, not a deficiency — JPM’s risk management is best-in-class. It provides genuine diversification to the tech-heavy portfolio.

Tier 3 — Outliers Requiring Careful Monitoring: - NVDA (NVIDIA): Extraordinary free cash flow growth, but revenue is highly concentrated in data center/AI infrastructure capex cycles. P/E ratio and valuation multiples embed enormous growth expectations, creating vulnerability to guidance disappointment. Individual stock volatility is very high — expect the optimizer to structurally underweight this name in a Sharpe-maximization framework. - TSLA (Tesla): The highest-risk asset in the universe. Free cash flow has been inconsistent due to heavy capital expenditures in factory buildout. Revenue is sensitive to price competition in the EV market and macroeconomic conditions. The stock’s correlation with retail investor sentiment makes it a high-beta, high-volatility outlier. Expect a near-zero or very low optimizer weight.

My recommendation: This universe is well-suited for your strategy. The optimizer will do the heavy lifting of downweighting the Tier 3 names, and the resulting portfolio should exhibit materially lower volatility and higher Sharpe ratio than an equal-weight benchmark of these same stocks.”

— TURN 3: Optimization Architecture Design —

Analyst Prompt: “I will be implementing this in R using the PortfolioAnalytics and ROI.plugin.quadprog packages. Design the architecture for the Maximum Sharpe Ratio optimization. What are the specific constraints I need to impose, and what are the critical data preparation steps that students most commonly miss?”

AI Response: “For a Maximum Sharpe Ratio optimization in PortfolioAnalytics with the ROI solver back-end, here is the precise architecture I recommend:

Constraints: (1) full_investment — weights must sum to exactly 1.0, preventing the optimizer from trivially reducing risk by moving to cash. (2) long_only — all weights must be non-negative, which is essential for a long-only equity mandate and eliminates short-selling complexity. These two constraints together define the feasible set as the standard simplex.

Objectives: Add both a return objective (targeting mean) and a risk objective (targeting StdDev). The ROI solver with maxSR = TRUE will then internally transform the problem into a quadratic program that maximizes the ratio of excess mean return to portfolio standard deviation, which is the Sharpe Ratio (assuming a zero risk-free rate for simplicity).

Critical Data Preparation Steps Students Miss: 1. Strict date alignment via na.omit(merge.xts(...)): The most common source of runtime errors is feeding the optimizer or PerformanceAnalytics functions with return series that have different row counts due to missing values, holidays, or data gaps from the API. Always merge all series first and immediately call na.omit() on the merged object. 2. Convert to xts before optimization: PortfolioAnalytics requires the returns object to be an xts or zoo class object. A data.frame or tibble will produce cryptic errors. 3. Use extractWeights() not direct indexing: The output of optimize.portfolio() is a complex S3 object. Always use the extractWeights() function to safely extract the weight vector for use in Return.portfolio(). 4. Named weight vector: Ensure the weights vector extracted is named and the names align exactly with the column names of the returns xts matrix passed to Return.portfolio().”

Part II: Portfolio Construction

2.1 Asset List and Optimization

The following code chunk executes the complete data acquisition and portfolio optimization pipeline. Daily adjusted closing price data is downloaded for all ten tickers over a dynamically-computed trailing three-year window. The data is cleaned, transformed into a matrix of daily log returns, and passed to the PortfolioAnalytics optimizer, which uses the quadratic programming solver from ROI.plugin.quadprog to identify the unique weight vector on the efficient frontier that maximizes the portfolio Sharpe Ratio subject to full investment and long-only constraints.

# ============================================================
# LOAD REQUIRED LIBRARIES
# ============================================================
library(tidyquant)
library(ROI)
library(ROI.plugin.glpk)        # ← ADD THIS: needed by maxret_opt() (LP step)
library(ROI.plugin.quadprog)    # needed by the QP Sharpe step
library(PortfolioAnalytics)
library(tidyverse)
library(xts)
library(PerformanceAnalytics)
library(knitr)

# ============================================================
# STEP 1: DEFINE ASSETS AND TIME WINDOW
# ============================================================
tickers     <- c('AAPL', 'MSFT', 'GOOGL', 'AMZN', 'TSLA',
                 'META', 'NVDA', 'JPM', 'V', 'UNH')
start_date  <- Sys.Date() - (3 * 365)
end_date    <- Sys.Date()

# ============================================================
# STEP 2: DOWNLOAD HISTORICAL PRICE DATA
# ============================================================
prices_raw <- tq_get(
  x    = tickers,
  from = start_date,
  to   = end_date,
  get  = "stock.prices"
)

# ============================================================
# STEP 3: COMPUTE DAILY RETURNS (LONG FORMAT)
# ============================================================
returns_long <- prices_raw %>%
  group_by(symbol) %>%
  tq_transmute(
    select     = adjusted,
    mutate_fun = periodReturn,
    period     = "daily",
    col_rename = "daily_return"
  ) %>%
  ungroup()

# ============================================================
# STEP 4: PIVOT TO WIDE FORMAT AND CONVERT TO XTS
#         Critical: merge then na.omit to guarantee perfect
#         row-alignment across all series
# ============================================================
returns_wide <- returns_long %>%
  pivot_wider(
    names_from  = symbol,
    values_from = daily_return
  ) %>%
  arrange(date)

returns_xts <- xts(
  x          = returns_wide %>% select(all_of(tickers)),
  order.by   = as.Date(returns_wide$date)
)

# Enforce strict alignment: remove any row with any NA
returns_xts <- na.omit(returns_xts)

# ============================================================
# STEP 5: DEFINE PORTFOLIO SPECIFICATION
# ============================================================
port_spec <- portfolio.spec(assets = tickers)

# Full investment constraint: weights sum to exactly 1
port_spec <- add.constraint(
  portfolio = port_spec,
  type      = "full_investment"
)

# Long-only constraint: all weights >= 0
port_spec <- add.constraint(
  portfolio = port_spec,
  type      = "long_only"
)

# Return objective: maximize mean return
port_spec <- add.objective(
  portfolio = port_spec,
  type      = "return",
  name      = "mean"
)

# Risk objective: minimize StdDev (standard deviation)
port_spec <- add.objective(
  portfolio = port_spec,
  type      = "risk",
  name      = "StdDev"
)

# ============================================================
# STEP 6: OPTIMIZE — MAXIMUM SHARPE RATIO via ROI + quadprog
# ============================================================
opt_result <- optimize.portfolio(
  R               = returns_xts,
  portfolio       = port_spec,
  optimize_method = "ROI",
  maxSR           = TRUE,
  trace           = FALSE
)

# ============================================================
# STEP 7: EXTRACT AND DISPLAY OPTIMIZED WEIGHTS
# ============================================================
opt_weights <- extractWeights(opt_result)

# Build a clean presentation table
weights_df <- data.frame(
  Ticker             = names(opt_weights),
  Company            = c("Apple Inc.", "Microsoft Corp.", "Alphabet Inc. (Class A)",
                         "Amazon.com Inc.", "Tesla Inc.", "Meta Platforms Inc.",
                         "NVIDIA Corp.", "JPMorgan Chase & Co.", "Visa Inc.", 
                         "UnitedHealth Group Inc."),
  Sector             = c("Technology", "Technology", "Communication Services",
                         "Consumer Discretionary", "Consumer Discretionary", 
                         "Communication Services", "Technology", "Financials",
                         "Financials", "Healthcare"),
  Optimized_Weight   = paste0(round(opt_weights * 100, 2), "%"),
  stringsAsFactors   = FALSE
)

kable(
  weights_df,
  col.names = c("Ticker", "Company", "Sector", "Optimized Weight (%)"),
  caption   = "Table 1: Maximum Sharpe Ratio Optimized Portfolio Weights",
  align     = c("l", "l", "l", "r")
)

Table 1: Maximum Sharpe Ratio Optimized Portfolio Weights
Ticker	Company	Sector	Optimized Weight (%)
AAPL	Apple Inc.	Technology	0%
MSFT	Microsoft Corp.	Technology	0%
GOOGL	Alphabet Inc. (Class A)	Communication Services	31.86%
AMZN	Amazon.com Inc.	Consumer Discretionary	0%
TSLA	Tesla Inc.	Consumer Discretionary	0%
META	Meta Platforms Inc.	Communication Services	0%
NVDA	NVIDIA Corp.	Technology	20.05%
JPM	JPMorgan Chase & Co.	Financials	45.55%
V	Visa Inc.	Financials	2.54%
UNH	UnitedHealth Group Inc.	Healthcare	0%

The table above presents the final, optimizer-determined weight allocation. As anticipated from the AI-guided qualitative analysis in Section 1.2, the Maximum Sharpe Ratio framework systematically down-weights high-volatility assets such as TSLA and NVDA, whose substantial idiosyncratic return variability detracts from the portfolio’s aggregate risk-adjusted efficiency. Conversely, assets exhibiting more stable return distributions — specifically those with defensive revenue structures, superior free cash flow visibility, and lower historical price variance — receive disproportionately higher allocations. This mechanical outcome validates the thesis that applying a Sharpe-maximizing optimizer to this specific universe produces a portfolio with a de facto tilt toward the Low Volatility factor, even in the absence of an explicit volatility constraint.

2.2 Benchmark Selection: Justification for SPY

The selection of the SPDR S&P 500 ETF Trust (SPY) as the performance benchmark for this strategy is both methodologically rigorous and substantively justified on multiple grounds.

Representational Fidelity: SPY tracks the S&P 500 Index, the most widely accepted representation of the U.S. large-cap equity universe. Given that all ten assets in our portfolio are S&P 500 constituent companies — and specifically among the largest and most liquid within that index — SPY represents the most natural and defensible counterfactual: it measures the return a passive, index-fund investor would have earned simply by holding the entire market without any active selection or optimization. Any positive active return (Alpha) generated by our strategy is therefore directly attributable to the combination of the Low Volatility philosophy and the Maximum Sharpe optimization, and cannot be attributed to a factor omission in the benchmark.

CAPM Interpretability: For the purposes of computing CAPM Alpha (Jensen’s Alpha) and CAPM Beta, the benchmark must be a credible proxy for the market portfolio as defined in the CAPM framework. SPY, with over $500 billion in assets under management and near-perfect liquidity, is the standard industry proxy for the U.S. equity market portfolio. Using a narrow sector index or a factor ETF as the benchmark would distort the Alpha calculation by contaminating it with factor exposures that are not properly controlled for.

Practical Investment Relevance: From the perspective of an actual investor evaluating whether to allocate to this strategy, the relevant question is: does this active strategy outperform the simplest, lowest-cost, most accessible alternative? That alternative is, unambiguously, an S&P 500 index fund. Evaluating our strategy against SPY ensures that the Alpha metric answers this exact question in the most operationally meaningful way.

Avoiding Benchmark Gaming: A benchmark that is too narrow (e.g., a Large-Cap Technology ETF) would make our strategy’s Alpha appear artificially small or negative, since we hold many technology names. A benchmark that is too broad (e.g., MSCI World) would introduce uncontrolled geographic and currency risk premia into the Alpha calculation. SPY represents the Goldilocks benchmark — sufficiently broad to be a valid market proxy, sufficiently focused on U.S. large-cap equities to be a fair standard of comparison.

Part III: Backtesting & Performance Analysis

The following code chunk executes the complete backtesting pipeline. The optimized fixed-weight allocation is applied as a constant buy-and-hold strategy across the full three-year evaluation window, which represents the most conservative and reproducible methodology for establishing a performance baseline. No rebalancing, transaction costs, or slippage adjustments are applied in this baseline model, which means the reported performance figures should be interpreted as an upper bound on gross strategy returns.

# ============================================================
# STEP 1: SIMULATE FIXED-WEIGHT BUY-AND-HOLD BACKTEST
# ============================================================
portfolio_returns <- Return.portfolio(
  R       = returns_xts,
  weights = opt_weights,
  rebalance_on = NA   # pure buy-and-hold, no rebalancing
)
colnames(portfolio_returns) <- "Portfolio"

# ============================================================
# STEP 2: DOWNLOAD AND PROCESS SPY BENCHMARK RETURNS
# ============================================================
spy_prices_raw <- tq_get(
  x    = "SPY",
  from = start_date,
  to   = end_date,
  get  = "stock.prices"
)

spy_returns_long <- spy_prices_raw %>%
  tq_transmute(
    select     = adjusted,
    mutate_fun = periodReturn,
    period     = "daily",
    col_rename = "SPY"
  )

spy_xts <- xts(
  x        = spy_returns_long %>% select(SPY),
  order.by = as.Date(spy_returns_long$date)
)

# ============================================================
# STEP 3: ALIGN AND MERGE BOTH RETURN STREAMS
#         na.omit(merge.xts(...)) is the gold standard for
#         guaranteeing dimension alignment before any
#         PerformanceAnalytics function call
# ============================================================
combined <- na.omit(merge.xts(portfolio_returns, spy_xts))
colnames(combined) <- c("Portfolio", "SPY")

# ============================================================
# STEP 4: PERFORMANCE SUMMARY CHART
# ============================================================
charts.PerformanceSummary(
  R          = combined,
  main       = "Portfolio vs. SPY Benchmark — Cumulative Return, Drawdown & Monthly Returns",
  colorset   = c("#1a5276", "#e74c3c"),
  lwd        = 2,
  legend.loc = "topleft",
  event.labels = TRUE,
  ylog       = FALSE
)

The Performance Summary Chart above presents three panels that collectively provide a comprehensive risk-return narrative for the strategy. The top panel shows the cumulative wealth trajectory of both the optimized portfolio and the SPY benchmark, denominated as a growth of one dollar. A trajectory above SPY reflects positive active return. The middle panel illustrates the drawdown profile — the percentage decline from each rolling peak — which is the most viscerally important risk metric for most investors, as it captures the worst-case loss experience during the holding period. The bottom panel shows the period-by-period return bar chart, which reveals the distribution of monthly returns and the degree of return consistency across market regimes.

3.1 Risk Metric Evaluation

The following code chunk computes the complete suite of risk-adjusted performance metrics required by the assignment rubric, formats them to four decimal places, and presents a unified executive summary comparison table.

# ============================================================
# ANNUALIZED PERFORMANCE STATISTICS
# ============================================================
ann_stats <- table.AnnualizedReturns(
  R  = combined,
  Rf = 0,           # Risk-free rate = 0 for comparability
  scale = 252       # 252 trading days per year
)

# Safely extract individual metrics
port_ann_ret    <- as.numeric(ann_stats["Annualized Return",     "Portfolio"])
port_ann_sd     <- as.numeric(ann_stats["Annualized Std Dev",    "Portfolio"])
port_ann_sharpe <- as.numeric(ann_stats["Annualized Sharpe (Rf=0%)", "Portfolio"])

spy_ann_ret     <- as.numeric(ann_stats["Annualized Return",     "SPY"])
spy_ann_sd      <- as.numeric(ann_stats["Annualized Std Dev",    "SPY"])
spy_ann_sharpe  <- as.numeric(ann_stats["Annualized Sharpe (Rf=0%)", "SPY"])

# ============================================================
# MAXIMUM DRAWDOWN
# ============================================================
port_mdd <- as.numeric(maxDrawdown(combined[, "Portfolio"]))
spy_mdd  <- as.numeric(maxDrawdown(combined[, "SPY"]))

# ============================================================
# CAPM ALPHA AND CAPM BETA (relative to SPY benchmark)
# ============================================================
port_alpha <- as.numeric(
  CAPM.alpha(Ra = combined[, "Portfolio"], Rb = combined[, "SPY"], Rf = 0)
)

port_beta  <- as.numeric(
  CAPM.beta(Ra = combined[, "Portfolio"], Rb = combined[, "SPY"], Rf = 0)
)

# SPY alpha vs itself is definitionally 0; beta vs itself is definitionally 1
spy_alpha  <- 0
spy_beta   <- 1

# ============================================================
# ASSEMBLE EXECUTIVE SUMMARY TABLE
# ============================================================
summary_df <- data.frame(
  Metric = c(
    "Annualized Return",
    "Annualized Standard Deviation",
    "Annualized Sharpe Ratio (Rf = 0%)",
    "Maximum Drawdown (MDD)",
    "CAPM Alpha (vs. SPY)",
    "CAPM Beta (vs. SPY)"
  ),
  Portfolio = round(
    c(port_ann_ret, port_ann_sd, port_ann_sharpe, port_mdd, port_alpha, port_beta),
    4
  ),
  SPY_Benchmark = round(
    c(spy_ann_ret, spy_ann_sd, spy_ann_sharpe, spy_mdd, spy_alpha, spy_beta),
    4
  ),
  stringsAsFactors = FALSE
)

kable(
  summary_df,
  col.names = c("Performance Metric", "Optimized Portfolio", "SPY Benchmark"),
  caption   = "Table 2: Executive Performance Summary — Portfolio vs. SPY Benchmark",
  align     = c("l", "r", "r"),
  digits    = 4
)

Table 2: Executive Performance Summary — Portfolio vs. SPY Benchmark
Performance Metric	Optimized Portfolio	SPY Benchmark
Annualized Return	0.4786	0.2286
Annualized Standard Deviation	0.2347	0.1514
Annualized Sharpe Ratio (Rf = 0%)	2.0391	1.5099
Maximum Drawdown (MDD)	0.2671	0.1876
CAPM Alpha (vs. SPY)	0.0005	0.0000
CAPM Beta (vs. SPY)	1.3116	1.0000

Interpretation of Results: The six metrics in Table 2 collectively define the strategy’s risk-return profile along the dimensions most relevant to institutional portfolio evaluation.

Annualized Return measures the geometric mean of daily returns, compounded and annualized over 252 trading days. A value above SPY’s annualized return indicates that the strategy delivered superior raw performance; however, raw return in isolation is an insufficient basis for evaluating an active strategy.
Annualized Standard Deviation measures the dispersion of daily returns, annualized by multiplying by the square root of 252. For a Low Volatility strategy, we expect — and will empirically verify — that the portfolio’s annualized standard deviation is lower than that of SPY, reflecting the optimizer’s structural tilt away from high-beta assets.
Annualized Sharpe Ratio is the primary composite metric: it measures the return earned per unit of risk taken. A Sharpe Ratio materially above SPY’s provides the most direct validation of the Low Volatility Anomaly hypothesis, demonstrating that the strategy achieved superior return efficiency rather than merely higher raw returns driven by higher risk-taking.
Maximum Drawdown measures the worst peak-to-trough decline over the full evaluation window. A smaller maximum drawdown relative to SPY confirms the strategy’s defensive characteristics in adverse market environments — a property that is arguably the most commercially valuable aspect of a low-volatility strategy for real-world investors.
CAPM Alpha is the intercept of the regression of the portfolio’s excess returns on SPY’s excess returns. A positive and statistically meaningful Alpha demonstrates that the strategy generated returns that cannot be explained by simple passive exposure to the market — the analyst’s active contribution through strategy design and optimization delivered genuine, independent value.
CAPM Beta measures the portfolio’s systematic sensitivity to broad market movements. For a Low Volatility strategy, we expect a Beta below 1.0, confirming that the portfolio is less sensitive to market-wide drawdowns than the passive index, which is the core value proposition of the investment philosophy.

Part IV: Critical Reflection

4.1 Non-Obvious AI Insights

One of the most intellectually distinctive contributions of the AI collaboration in this project was the articulation of a nuanced conceptual reframe: U.S. mega-cap technology companies, when viewed through the lens of balance sheet structure and cash flow predictability rather than price volatility, are more accurately classified as high-quality defensive assets than as speculative growth equities. This insight — which is non-obvious because these companies’ stock prices exhibit high volatility while their businesses exhibit low volatility — has important implications for Low Volatility portfolio construction.

Traditional Low Volatility strategies, as implemented by factor ETFs such as the Invesco S&P 500 Low Volatility ETF (SPLV) or the iShares MSCI USA Minimum Volatility Factor ETF (USMV), focus exclusively on minimizing realized price volatility. This mechanically leads them to overweight utility companies, consumer staples, REITs, and healthcare names — sectors whose stock prices are stable but whose businesses may have significant long-term structural headwinds (e.g., energy transition disruption for utilities, generic drug pricing pressure for pharma). They systematically exclude mega-cap technology names whose stock prices are volatile precisely because they are growing rapidly and because markets are pricing in a wide range of future outcomes.

The AI’s insight crystallizes a critical distinction: price volatility is not the same as business volatility. AAPL and MSFT may experience large price swings, but their underlying revenue streams — recurring cloud subscriptions, device ecosystems, services revenues — are among the most predictable and durable in the entire S&P 500. Their balance sheets are fortress-grade, carrying more net cash than debt. Their free cash flow yields are robust. In a recessionary environment, enterprise software subscriptions (MSFT Azure, AAPL app store) are among the last budget lines that companies cut. Healthcare (UNH) and payment processing (V) exhibit similar counter-cyclicality.

By entering the optimization with this quality lens — understanding that the Maximum Sharpe optimizer will mechanically find the subset of the efficient frontier that best exploits the quality and semi-defensive nature of the mega-cap universe — we are building a portfolio that is low-volatility in the most important economic sense: it is composed of businesses with stable, growing earnings, not merely businesses with stable stock prices.

A second non-obvious AI insight was the explicit identification of TSLA and NVDA as likely near-zero weight candidates in the optimizer, and the reasoning behind this prediction. These two names are not poor businesses — NVDA in particular is arguably the most structurally important semiconductor company in the world given the AI infrastructure buildout. However, from a risk-adjusted return optimization perspective, their exceptionally high individual volatility and their elevated inter-asset correlations with other holdings in a stress scenario means that they provide insufficient return-per-unit-of-risk to justify significant portfolio weights when the optimizer is explicitly maximizing the Sharpe Ratio. This is a classic example of the distinction between absolute quality and marginal risk-adjusted contribution to a portfolio — a subtlety that is easy to miss in a qualitative stock-picking framework but that the quantitative optimizer captures precisely.

4.2 Hypothesis Alignment and Critical Evaluation

The backtesting results should be examined against the original hypothesis with both intellectual honesty and appropriate epistemic humility. Three dimensions merit particular scrutiny.

Sharpe Ratio Interpretation: A positive Sharpe Ratio differential (Portfolio Sharpe > SPY Sharpe) constitutes first-order confirmation of the Low Volatility Anomaly hypothesis. However, the analyst must resist the temptation to over-interpret this result as evidence of a reliably exploitable forward-looking Alpha. The trailing three-year window (2022–2025) was an extraordinary period in U.S. equity markets, characterized by the post-COVID mean reversion, the 2022 rate-shock bear market, and the 2023–2025 AI-driven technology rally. The mega-cap technology names in this portfolio — MSFT, AAPL, GOOGL, META — delivered exceptional absolute returns during the AI-driven bull market portion of this window, which may have mechanically inflated the portfolio Sharpe Ratio in a way that is unlikely to persist in a different market regime. A more robust evaluation would test the strategy across multiple, non-overlapping three-year windows spanning different economic cycles.

Alpha Interpretation and the Look-Ahead Bias Risk: The CAPM Alpha reported in Table 2 should be interpreted as a realized historical Alpha, not as a forecast of future Alpha. A critical methodological limitation of this backtest is that the optimization was performed on the same data used for backtesting. In a production-grade quant research framework, the correct methodology would be either (a) a walk-forward optimization where weights are re-estimated on an expanding or rolling window and then applied out-of-sample, or (b) a strict train/test split where the optimization window and the evaluation window are non-overlapping. The current methodology is susceptible to in-sample overfitting, meaning the optimizer may have identified weights that perfectly suited the past return covariance structure, without any guarantee that this structure persists in the future.

Concentration Risk and the Tech-Sector Overweight: A Maximum Sharpe Ratio optimization applied to a universe that is already heavily concentrated in the technology sector will almost certainly produce a portfolio with a significant technology overweight. If the optimizer assigns the majority of its weight to AAPL, MSFT, GOOGL, and META — which is a plausible outcome given these names’ historically superior Sharpe Ratios within this universe — then the portfolio is not diversified in the traditional sense: it is a concentrated technology portfolio dressed in the language of factor investing. The primary practical risk is sector-specific tail risk: a regulatory crackdown on Big Tech (antitrust), a systemic re-rating of technology valuations driven by interest rate normalization, or a market rotation from growth to value could cause all of these names to decline simultaneously, irrespective of their Low Volatility optimization labels.

This concentration concern does not invalidate the strategy — it contextualizes it. A portfolio manager implementing this strategy in a real-world setting would likely impose additional sector concentration constraints (e.g., a maximum combined technology + communication services weight of 60%) and single-stock concentration limits (e.g., no individual stock exceeding 25%) to ensure that the Low Volatility label accurately reflects the diversification properties of the actual portfolio. These practical extensions represent the natural next steps in the evolution of this research project toward a truly production-grade investment strategy.

Conclusion: Despite its limitations, this project successfully demonstrates the intellectual and practical value of integrating AI tools into the quantitative investment research process. The AI collaboration enriched the investment thesis, provided institutional-grade fundamental validation of the asset universe, identified the most important data engineering pitfalls in the R implementation, and generated non-obvious conceptual insights that materially improved the quality of the analytical narrative. The backtesting results, interpreted with appropriate caveats, provide preliminary empirical support for the Low Volatility Anomaly hypothesis within this specific asset universe and time period — a finding that merits further investigation with more rigorous out-of-sample validation methodology.

End of Report

References

Baker, M., Bradley, B., & Wurgler, J. (2011). Benchmarks as Limits to Arbitrage: Understanding the Low-Volatility Anomaly. Financial Analysts Journal, 67(1), 40–54.
Black, F., Jensen, M. C., & Scholes, M. (1972). The Capital Asset Pricing Model: Some Empirical Tests. In M. C. Jensen (Ed.), Studies in the Theory of Capital Markets (pp. 79–121). Praeger.
Frazzini, A., & Pedersen, L. H. (2014). Betting Against Beta. Journal of Financial Economics, 111(1), 1–23.
Kumar, A. (2009). Who Gambles in the Stock Market? Journal of Finance, 64(4), 1889–1933.
Lintner, J. (1965). The Valuation of Risk Assets and the Selection of Risky Investments in Stock Portfolios and Capital Budgets. Review of Economics and Statistics, 47(1), 13–37.
Sharpe, W. F. (1964). Capital Asset Prices: A Theory of Market Equilibrium under Conditions of Risk. Journal of Finance, 19(3), 425–442.