Trading Actors¶

Trading actors implement the policy interface for TorchTrade environments. Beyond standard neural network policies, TorchTrade provides rule-based strategies and LLM-powered agents.

Available Actors¶

Actor	Type	Use Case
RuleBasedActor	Deterministic Strategy	Baselines, debugging, research benchmarks
MeanReversionActor	Rule-Based (Bollinger + Stoch RSI)	Ranging markets, baseline comparisons
FrontierLLMActor	LLM (API)	Research, rapid prototyping with GPT/Claude
LocalLLMActor	LLM (Local inference)	Production, privacy, cost efficiency

RuleBasedActor¶

Abstract base class for deterministic trading strategies. Follows a two-phase pattern: preprocess (compute indicators on full dataset upfront) then decide (extract features and apply rules at each step).

from torchtrade.actor.rulebased.base import RuleBasedActor

class MyStrategy(RuleBasedActor):
    def get_preprocessing_fn(self):
        def preprocess(df):
            df["features_sma_20"] = df["close"].rolling(20).mean()
            df["features_rsi_14"] = compute_rsi(df["close"], 14)
            return df
        return preprocess

    def select_action(self, observation):
        data = self.extract_market_data(observation)
        sma = self.get_feature(data, "features_sma_20")[-1]
        rsi = self.get_feature(data, "features_rsi_14")[-1]

        if rsi < 30 and price < sma:
            return 2  # Buy
        elif rsi > 70 and price > sma:
            return 0  # Sell
        return 1  # Hold

MeanReversionActor¶

Concrete implementation using Bollinger Bands and Stochastic RSI. Buys when price is below lower band with oversold Stoch RSI, sells when above upper band with overbought Stoch RSI.

Parameter	Default	Description
`bb_window`	20	Bollinger Bands period
`bb_std`	2.0	Bollinger Bands standard deviations
`stoch_rsi_window`	14	Stochastic RSI period
`oversold_threshold`	20.0	Stoch RSI oversold level
`overbought_threshold`	80.0	Stoch RSI overbought level

See examples/rule_based/ for offline and live usage examples.

FrontierLLMActor¶

LLM-based actor using frontier model APIs (OpenAI, Anthropic, etc.) for trading decisions. Constructs prompts from market data and account state, queries the LLM, and parses actions from structured <think>...<answer> responses.

Parameter	Default	Description
`model`	`"gpt-5-nano"`	Model identifier
`symbol`	`"BTC/USD"`	Trading symbol for prompt context
`action_dict`	`{"buy": 2, "sell": 0, "hold": 1}`	Action name to index mapping
`debug`	`False`	Print prompts and responses

from torchtrade.actor import FrontierLLMActor

actor = FrontierLLMActor(
    market_data_keys=env.market_data_keys,
    account_state=env.account_state,
    model="gpt-4-turbo",
)

observation = env.reset()
output = actor(observation)  # Returns tensordict with "action" and "thinking"

Requires OPENAI_API_KEY in .env. See examples/llm/frontier/ for offline and live examples.

LocalLLMActor¶

Local LLM-based actor using vLLM or transformers for inference. Same prompt interface as FrontierLLMActor but runs models locally.

Parameter	Default	Description
`model`	`"Qwen/Qwen2.5-0.5B-Instruct"`	HuggingFace model ID
`backend`	`"vllm"`	`"vllm"` (faster, CUDA) or `"transformers"` (portable)
`quantization`	`None`	`None`, `"4bit"`, or `"8bit"`
`max_tokens`	`512`	Maximum tokens to generate
`temperature`	`0.7`	Sampling temperature
`action_space_type`	`"standard"`	`"standard"`, `"sltp"`, or `"futures_sltp"`

from torchtrade.actor import LocalLLMActor

actor = LocalLLMActor(
    model="Qwen/Qwen2.5-1.5B-Instruct",
    backend="vllm",
    quantization="4bit",
)

output = actor(observation)

For SLTP environments, pass action_space_type="sltp" and action_map=env.action_map. See examples/llm/local/ for offline and live examples.

pip install torchtrade[llm]  # Installs vllm, transformers, bitsandbytes