Hybrid Approach:
- Integrated posts mid-series: combine early projects into cohesive pipelines
- Capstone posts at the end: full end-to-end solution
Each Project = one real security problem
Each Part = one blog post
Each blog post produces code, visuals, and insight
(Weeks 1–2)
“From Raw Logs to Analysis-Ready Data”
- Homebrew, Python (pyenv or uv), Jupyter, VS Code
- Virtual environments, Dataset folder structure
- Lists, dicts, comprehensions, reading logs, timestamps
- DataFrames, filtering, grouping, feature creation
- SQLite basics, SQL joins, aggregations
- JSON / NoSQL logs, flattening nested data
- Web scraping threat intelligence
- Reusable ingestion pipeline
- Blog-ready diagrams
- Clean datasets for later projects
(Weeks 3–4)
“Turning Uncertainty Into Signal”
- Mean, median, variance, outliers
- Visualizations: histograms, scatter plots, time series
- Conditional probability, false positives
- Bayes theorem, prior vs posterior
- Bayesian login risk engine
- FFT intuition, beacon detection, periodic traffic
- Bayesian threat scoring notebook
- Signal-based detector
- Visual SOC artifacts
(Weeks 5)
“From Raw Logs to ML-Ready Features”
- Combine Projects 1–2 into one cohesive pipeline
- Includes:
- Raw log ingestion
- Data cleaning & normalization
- Feature engineering
- Exploratory visualization
- Preliminary anomaly detection using K-Means
- Goal: Demonstrate a working pipeline for downstream ML
(Weeks 5–7)
“Finding Attacks Without Labels”
- Conceptual introduction
- No labels, unknown threats
- Feature engineering, cluster interpretation, dimensionality reduction
- Density-based clustering, anomaly detection
- Explainability, ensemble logic, trade-offs
- Threat hunting notebook
- Cluster-based anomaly detector
- SOC-ready visuals
(Weeks 8–9)
“Teaching Machines What ‘Bad’ Looks Like”
- Trend analysis, capacity planning
- Overfitting, evaluation metrics
- Dense layers, feature extraction, model training
- Precision, recall, confusion matrices
- Streaming data, live inference
- Phishing classifier
- Network protocol model
- Performance evaluation framework
(Weeks 10–11)
“Detection Without Signatures”
- Filters, feature maps, text/malware intuition
- Tokenization, embeddings, zero-day detection
- Reconstruction loss, training on normal data
- Reducing noise, improving detection
- Signature-less anomaly engine
- Deep learning portfolio artifacts
(Weeks 12–13)
“Solving the Problem You Actually Have”
- Reframing detection tasks, problem-solving mindset
- Multi-input models, graph thinking
- Evolutionary search, feature tuning
- Advanced CNN model
- Genetic optimization demo
(Weeks 14–16)
“From Student to Practitioner”
Goal: Demonstrate the entire end-to-end workflow, polished for portfolio and practical use.