Dataset Title: Multi-Asset Minute-Level Financial Dataset (May 2021 – April 2022) Overview: This dataset contains one-minute frequency financial data collected from 19 assets across five categories: - Stock Indices and Futures: NASDAQ Composite, S&P 500, DJIA, Nikkei 225, FTSE 100, E-mini Russell 2000, and futures (S&P 500 Futures, DJIA Futures, NASDAQ Futures) - Commodity Futures: Crude Oil/USD, Silver/USD, Gold/USD - Cryptocurrencies: BTC/USD, CMC Crypto 200 Index - Foreign Exchange: GBP/USD, EUR/USD, USD/JPY - Other Instruments: 10-Year U.S. Treasury Yield, Volatility Index (VIX) Time Span: May 16, 2021 – April 20, 2022 (Only market open days are included) Data Format: - Minute-level records - Each asset has 10-dimensional features per time step: [high/end, low/end, open/end, 1 - open/end, m5, m10, m15, m20, m25, m30] - open and end as the opening and closing prices - high and low denote the highest and lowest price - Moving average (mN) is calculated over N minutes and normalized by the current closing price. File Format: All files are in NumPy binary format (.npy) and can be loaded with: import numpy as np features = np.load('ftse100/features.npy') File Structure: - sp500/ ├── features.npy # Shape: (3636, 32, 190) ├── targets.npy # Shape: (3636, 24, 2) ├── targets_eval.npy # Shape: (3636, 24, 2) - ftse100/ ├── features.npy # Shape: (4848, 32, 190) ├── targets.npy # Shape: (4848, 24, 2) ├── targets_eval.npy # Shape: (4848, 24, 2) File Format: All files are in NumPy binary format (.npy) and can be loaded using: import numpy as np features = np.load('ftse100/features.npy') Description: 1. features.npy - Shape: (T, N, D) - T: Number of time steps (sliding windows) — 3636 for S&P 500, 4848 for FTSE 100 - N: Input sequence of length = 32 - D: Feature dimension = 190 (19 financial assets × 10 features each) 2. targets.npy - Shape: (T, H, 2) - H: Prediction sequence of length = 24 - Each label is a binary classification vector: - [1, 0] for downward movement - [0, 1] for upward movement 3. targets_eval.npy - Shape: (T, H, 2) - H: Prediction sequence of length = 24 - Each entry: - [negative_return_abs, 0] if return is negative - [0, positive_return] if return is positive - Only one of the two elements is non-zero per time step. - Used for calculating return-based performance (e.g., Sharpe ratio).