cuFOLIO Skill
<!--
SPDX-FileCopyrightText: Copyright (c) 2023-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
SPDX-License-Identifier: Apache-2.0
-->
Purpose
Build and analyze quantitative portfolios with NVIDIA-accelerated Mean-CVaR optimization. Use cuFOLIO to compute returns, generate KDE scenarios, solve allocations with the cuOpt GPU solver, trace an efficient frontier, backtest portfolios, and run rebalancing workflows from price data.
When to Use
Use this skill when the task is to:
- Build or optimize a Mean-CVaR portfolio from stock prices.
- Allocate weights across tickers while controlling downside CVaR risk.
- Plot or inspect an efficient frontier for a portfolio universe.
- Produce a weights-by-risk-aversion table.
- Backtest an optimized portfolio against benchmarks.
- Rebalance a portfolio on a schedule or drift trigger.
- Run workflows on an S&P 500, S&P 100, Dow 30, or user-supplied price dataset.
Common trigger phrases include "optimize my portfolio", "build a CVaR portfolio", "use cuFOLIO on these tickers", "solve with cuOpt", "plot the efficient frontier", "show weights by risk aversion", "backtest this allocation", "rebalance monthly", "analyze my holdings with CVaR", "compare allocations", "reduce downside risk", "construct an allocation", "assess allocation options", "stress-test my holdings", "evaluate downside-risk exposure", "review my holdings under weight caps", "compare benchmark portfolios", "simulate CVaR scenarios", "screen portfolio risk", "optimize holdings under constraints", and "find a lower-risk allocation".
Do not use it for generic finance summaries, price forecasting, neural-network training, vehicle routing, or non-portfolio optimization.
Prerequisites
- Python environment with the installed package.
- NVIDIA GPU runtime with cuOpt and cuML installed.
- CUDA extra matching the host, such as or .
- exposing .
- Network access on first run if the default price CSV must be downloaded.
Setup
This skill drives the installed
package. A ready environment can come from the Brev launchable or from
NVIDIA-AI-Blueprints/cuFOLIO
after installing the matching CUDA extra.
In packaged agent/eval sandboxes,
may be available through
rather than as a separately published wheel. Verify the local package with
python -c "import cufolio"
before declaring it missing. Do not
, do not reimplement cuFOLIO workflows from scratch, and do not replace the package APIs with generic pandas/scipy/cvxpy portfolio code.
For concrete implementation details, use
references/workflows/agent_recipes.md
as the source of truth. It contains exact working shapes for loading prices, preparing returns, solving with cuOpt, building a 25-point frontier, backtesting against equal weight, and calling the rebalancer.
The default dataset is
data/stock_data/sp500.csv
. It is gitignored. Before a first-run download, tell the user this fetches public market data through the cuFOLIO/yfinance data helper and ask them to confirm:
python
import cvxpy as cp
from cufolio.cvar_parameters import CvarParameters
from cufolio.utils import download_data
download_data("data/stock_data", datasets=["sp500"])
SOLVER_SETTINGS = {"solver": cp.CUOPT, "verbose": False, "solver_method": "PDLP"}
cvar_params = CvarParameters(
w_min=0.0, w_max=1.0,
c_min=0.0, c_max=0.0,
risk_aversion=1.0, confidence=0.95,
)
Instructions
Briefly state the defaults being applied before execution, then use these guardrails:
- Load
data/stock_data/sp500.csv
; if it is missing, ask before downloading with cufolio.utils.download_data
. Do not glob, substitute, or fabricate price data.
- Validate user CSVs before solving: require a date-like index or first date column, numeric ticker columns, at least 60 rows after date filtering, and at least one requested ticker. If the user gives start/end dates, slice the price DataFrame before returns computation and report the retained date range. Filter tickers on the price DataFrame before returns are computed. does not take a ticker field.
- Compute LOG returns with
utils.calculate_returns(...)
.
- Generate scenarios with
cvar_utils.generate_cvar_data(...)
, KDE, and KDESettings(device="GPU")
.
- Define with explicit and . For ordinary "build the optimal portfolio" requests, set and so the result is fully invested instead of 100% cash.
- Build
cvar_optimizer.CVaR(returns_dict, cvar_params)
directly from that returns dictionary; keep tickers, scenario arrays, means, and covariance in the shapes returned by cuFOLIO helpers.
- Solve with NVIDIA cuOpt only. Before solving, verify and
str(cp.CUOPT) in {str(s) for s in cp.installed_solvers()}
. Pass to every single-shot solve or looped frontier solve. Never fall back to CLARABEL, SCS, ECOS, or another CPU solver. If cuOpt is absent, finish validation/setup and report that the GPU/cuOpt runtime is missing instead of fabricating a CPU result.
- For custom constraints, map user requests to : weight caps to /, risk appetite to , confidence level to , cash allowance to , and cardinality only when the package exposes an explicit asset-count constraint for the workflow. If constraints conflict (for example, a max weight too low to invest across the requested ticker count), explain the conflict and ask for the constraint to relax instead of guessing.
- If the user omits a benchmark for backtesting, use an equal-weight portfolio over the same tickers. If the user omits a constraint, keep the defaults table values and briefly restate consequential assumptions before solving.
- Deliver weights sorted by allocation, cash weight, expected return, CVaR, solver label (), and any requested frontier figure, weights table, backtest metrics, or rebalancing schedule. For tables, include tickers as columns or rows with decimal weights and percentages; for plots, preserve the returned cuFOLIO figure instead of redrawing from scratch.
- For report-grade answers, include evidence that the requested workflow actually ran. For an efficient frontier, state and use the requested (25 unless the user specifies otherwise). For a weights table, expand into ticker columns and include plus . For a backtest, include , , , and for both optimized and benchmark portfolios. For rebalancing, include , , and the tail of
cumulative_portfolio_value
.
Canonical Workflow Skeleton
Start positive cuFOLIO tasks from this shape and adapt only the requested output. For complete copyable functions, read
references/workflows/agent_recipes.md
before writing custom code.
python
import cvxpy as cp
import pandas as pd
from cufolio import backtest, cvar_optimizer, cvar_utils, rebalance, utils
from cufolio.cvar_parameters import CvarParameters
from cufolio.portfolio import Portfolio
from cufolio.settings import KDESettings, ReturnsComputeSettings, ScenarioGenerationSettings
if not hasattr(cp, "CUOPT") or str(cp.CUOPT) not in {str(s) for s in cp.installed_solvers()}:
raise RuntimeError("cuOpt GPU solver is required; do not substitute a CPU solver.")
SOLVER_SETTINGS = {"solver": cp.CUOPT, "verbose": False, "solver_method": "PDLP"}
prices = utils.get_input_data("data/stock_data/sp500.csv")
returns_dict = utils.calculate_returns(
prices,
regime_dict=None,
returns_compute_settings=ReturnsComputeSettings(return_type="LOG"),
)
returns_dict = cvar_utils.generate_cvar_data(
returns_dict,
ScenarioGenerationSettings(
fit_type="kde",
kde_settings=KDESettings(device="GPU"),
),
)
cvar_params = CvarParameters(
w_min=0.0,
w_max=1.0,
c_min=0.0,
c_max=0.0,
risk_aversion=1.0,
confidence=0.95,
)
optimizer = cvar_optimizer.CVaR(returns_dict, cvar_params)
result, optimal_portfolio = optimizer.solve_optimization_problem(
solver_settings=SOLVER_SETTINGS,
print_results=False,
)
For an efficient frontier or weights table, call:
python
results_df, fig, ax = cvar_utils.create_efficient_frontier(
returns_dict,
cvar_params,
SOLVER_SETTINGS,
ra_num=25,
show_plot=False,
show_discretized_portfolios=False,
benchmark_portfolios=False,
print_portfolio_results=False,
)
weights_table = pd.DataFrame(results_df["weights"].tolist(), index=results_df.index)
For a benchmark backtest, wrap the solved allocation in
Portfolio(name="cuOpt Optimal", tickers=returns_dict["tickers"], weights=optimal_portfolio.weights, cash=optimal_portfolio.cash)
, create an equal-weight
over the same
, then use
backtest.portfolio_backtester(..., test_method="historical").backtest_against_benchmarks(...)
. The backtester returns
.
For monthly rebalancing, write the price DataFrame to a CSV path first. Instantiate
rebalance.rebalance_portfolio(dataset_directory=<csv_path>, ...)
with
re_optimize_criteria={"type": "drift_from_optimal", "threshold": 0, "norm": 1}
and call
re_optimize(transaction_cost_factor=..., plot_title="Monthly Rebalancing")
. The rebalancer returns
(results_dataframe, re_optimize_dates, cumulative_portfolio_value)
.
Data and Defaults
| Setting | Default |
|---|
| Dataset | data/stock_data/sp500.csv
|
| Date range | Full available range |
| Portfolio type | Long-only |
| Max weight | None unless specified |
| Risk aversion | |
| Confidence | |
| Scenario method | KDE on GPU |
| Solver | cuOpt GPU with PDLP |
| Rebalancing | None unless requested |
The default S&P 500 file is a historical snapshot and can omit current constituents. User-supplied CSVs should be date-indexed price tables with ticker columns, compatible with
. If requested tickers are absent, drop them, report the omissions, and continue with available columns unless the user explicitly asks you to fetch other data.
Key APIs
Use the package APIs instead of reimplementing portfolio math or simulation loops. cuFOLIO helpers return flat objects:
has keys such as
,
,
, and
; do not index it as
.
solve_optimization_problem(...)
returns
, not a nested result dictionary.
- Returns:
utils.calculate_returns(input_dataset, regime_dict, returns_compute_settings)
.
- Regime filter: is or
{"name": "...", "range": ("YYYY-MM-DD", "YYYY-MM-DD")}
; it is not keyed by regime name and does not contain tickers.
- Scenarios:
cvar_utils.generate_cvar_data(returns_dict, scenario_generation_settings)
.
- Optimizer:
cvar_optimizer.CVaR(returns_dict, cvar_params)
.
- Solve:
result_row, portfolio = cvar_problem.solve_optimization_problem(solver_settings=SOLVER_SETTINGS, print_results=False)
.
- Efficient frontier:
cvar_utils.create_efficient_frontier(returns_dict, cvar_params, solver_settings=SOLVER_SETTINGS, ra_num=25)
. The returned includes metrics, a dict column, and .
- Portfolio:
Portfolio(name="", tickers=None, weights=None, cash=0.0, time_range=None)
; pass tickers and a flat array-like aligned to those tickers.
- Backtest: create objects for the optimized allocation and each benchmark; for an equal-weight benchmark, use weights of and , then call
backtest.portfolio_backtester(test_portfolio, returns_dict, risk_free_rate=0.0, test_method="historical", benchmark_portfolios=[...]).backtest_against_benchmarks(...)
.
- Rebalance:
rebalance.rebalance_portfolio(...)
requires to be a CSV path, not a DataFrame. Call ; it returns (results_dataframe, re_optimize_dates, cumulative_portfolio_value)
.
- Settings models: ,
ScenarioGenerationSettings
, , , and .
Examples
- "Build the optimal portfolio from the S&P 500": load prices, compute LOG returns, generate GPU KDE scenarios, set long-only fully invested , solve with cuOpt, and report diversified weights plus return/CVaR.
- "Plot the efficient frontier": call
create_efficient_frontier(...)
, return , and show or save the figure as requested.
- "Give me weights by risk aversion": expand into a per-asset table.
- "Backtest against equal weight": build the optimized and equal-weight objects, then use the cuFOLIO backtester and report Sharpe, Sortino, and max drawdown.
- "Backtest monthly rebalancing": configure with the drift trigger above and run
re_optimize(transaction_cost_factor=...)
.
Limitations
- Requires an NVIDIA GPU with cuOpt and cuML; CPU solvers are intentionally disallowed.
- CPU-only eval containers can still validate routing, data handling, and reporting behavior, but they cannot produce a valid cuOpt solve. In that case, report the missing GPU/cuOpt runtime explicitly.
- Default price data is a historical snapshot and may omit current constituents.
- First-run dataset download depends on network access unless the user supplies a CSV.
Troubleshooting
- Missing default CSV or : explain that cuFOLIO will fetch public market data with
download_data("data/stock_data", datasets=["sp500"])
; run it only after user confirmation.
- or missing : install the CUDA extra matching the host and verify with
python -c "import cvxpy as cp; print(hasattr(cp, 'CUOPT'), cp.installed_solvers())"
.
- for or GPU KDE failures: confirm cuML is present with and keep
KDESettings(device="GPU")
.
- Ordinary optimization returns all cash: set in .
- Solver reports infeasible or no solution: check for contradictory bounds, too few tickers for the requested caps/cardinality, or a date filter that leaves too little data; report the smallest constraint change that would make the request feasible.
- Requested tickers are absent from the default CSV: report them and proceed with the remaining requested tickers.
- User CSV fails validation: ask for a date-indexed price table or a CSV whose first column is dates and remaining columns are numeric ticker prices; mention the minimum 60-row post-filter requirement.