Skip to content

Commit cdd1c24

Browse files
[DOCS] Fixing formatting in Wind Turbine Anomaly (#1750)
Signed-off-by: Sebastian Golebiewski <sebastianx.golebiewski@intel.com>
1 parent f94a222 commit cdd1c24

File tree

1 file changed

+13
-4
lines changed
  • manufacturing-ai-suite/industrial-edge-insights-time-series/docs/user-guide/wind-turbine-anomaly-detection

1 file changed

+13
-4
lines changed

manufacturing-ai-suite/industrial-edge-insights-time-series/docs/user-guide/wind-turbine-anomaly-detection/how-to-select-model.md

Lines changed: 13 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -183,6 +183,7 @@ patch_sklearn()
183183
6. **GPU Efficiency**: Measure GPU memory usage and utilization
184184

185185
**GPU-Specific Tests**:
186+
186187
- Monitor VRAM consumption during streaming inference
187188
- Test CPU fallback behavior if GPU unavailable
188189
- Benchmark GPU vs CPU for single-point inference
@@ -195,6 +196,7 @@ patch_sklearn()
195196
**Training Data**: 10k-50k samples minimum, 6+ months coverage
196197

197198
**Preprocessing** (remove these points):
199+
198200
- Wind speed < 3 m/s or > 14 m/s (cut-in/cut-out)
199201
- Power < 50 kW when 3 < wind_speed < 14 (curtailment)
200202
- NaN/missing values
@@ -228,14 +230,16 @@ Your dataset should contain:
228230
### Example Dataset Formats
229231

230232
**Current reference dataset** (`T1.csv`):
231-
```csv
233+
234+
```text
232235
timestamp,grid_activepower,wind_speed,Theoretical_Power_Curve,Wind Direction (°)
233236
01 01 2018 00:00,380.05,5.31,416.33,259.99
234237
01 01 2018 00:10,453.77,5.67,519.92,268.64
235238
```
236239

237240
**Your dataset** - adapt column names:
238-
```csv
241+
242+
```text
239243
time,power_output,wind_speed_ms,temperature
240244
2024-01-01 00:00:00,385.2,5.3,15.2
241245
2024-01-01 00:10:00,448.1,5.6,15.4
@@ -266,7 +270,7 @@ def preprocess(df):
266270
'power_output': 'grid_activepower',
267271
'wind_speed_ms': 'wind_speed'
268272
})
269-
273+
270274
df = df.dropna()
271275
return df[
272276
(df['wind_speed'] >= 3) & (df['wind_speed'] <= 14) &
@@ -300,11 +304,13 @@ error_threshold = 0.15 # Adjust based on your data
300304
**5. Update simulation data** (optional):
301305

302306
Place your dataset in `simulation-data/`:
307+
303308
```bash
304309
cp your_dataset.csv simulation-data/wind-turbine-anomaly-detection.csv
305310
```
306311

307312
Update Telegraf config to read from your file:
313+
308314
```toml
309315
# telegraf-config/Telegraf.conf
310316
[[inputs.file]]
@@ -345,6 +351,7 @@ model = train_model(X, y)
345351
df_combined = df # Use all turbines together
346352
model = train_model(df_combined)
347353
```
354+
348355
---
349356

350357
## Testing Checklist
@@ -374,6 +381,7 @@ Before deploying:
374381

375382

376383
**Evaluation Script**:
384+
377385
```python
378386
from sklearn.metrics import mean_absolute_error, r2_score
379387
import time, numpy as np
@@ -383,7 +391,7 @@ def evaluate(model, X_test, y_test):
383391
y_pred = model.predict(X_test)
384392
print(f"MAE: {mean_absolute_error(y_test, y_pred):.2f} kW")
385393
print(f"R²: {r2_score(y_test, y_pred):.4f}")
386-
394+
387395
# Inference speed
388396
num_iters = 1000
389397
start = time.perf_counter()
@@ -452,6 +460,7 @@ def evaluate(model, X_test, y_test):
452460
- Controls feature sampling per split
453461

454462
**Tuning Template**:
463+
455464
```python
456465
from sklearn.model_selection import GridSearchCV
457466

0 commit comments

Comments
 (0)