Building a Modern Cybersecurity Monitoring & Response System. A Practical Architecture Using Wazuh, SOAR, and Threat Intelligence
Building a Modern Cybersecurity Monitoring & Response System
A Practical Architecture Using Wazuh, SOAR, and Threat Intelligence
Why most security projects fail before they start
Many organizations want “better security”, but what they usually get is:
- Too many alerts, no action
- Expensive tools nobody understands
- Security dashboards that look good but don’t protect anything
- A system that depends on a few individuals’ knowledge
The real problem is not tools.
It’s system design.
This article explains how we design a production-ready cybersecurity monitoring & response system—one that is practical, auditable, and automatable—and why this architecture works in the real world.
The real objective (not marketing buzzwords)
The goal is not “installing SIEM” or “using AI”.
The real objectives are:
- Detect real threats, not noise
- Know who must respond, and when
- React fast before damage spreads
- Keep evidence for audit & investigation
- Stay flexible and vendor-neutral
This is a system engineering problem, not a product selection problem.
The architecture philosophy
We deliberately separate responsibilities.
Detection ≠ Automation ≠ Escalation ≠ Investigation
Each part must do one job extremely well.
The core stack we use (and why)
System architecture overview
graph TD
A["Endpoints / Servers / Cloud"] --> B["Wazuh Agent"]
B --> C["Wazuh Manager (SIEM/XDR)"]
C --> D["Shuffle SOAR"]
D -->|Create / Update Incident| E["DFIRTrack"]
D -->|SEV-1 / SEV-2| F["PagerDuty"]
D -->|Automated Response| G["Firewall / DNS / IAM / EDR"]
F --> H["On-call Engineer"]
This diagram shows how detection, automation, escalation, and investigation are cleanly separated but tightly integrated.
1. Detection layer — Wazuh
Wazuh acts as the security sensor network:
-
Collects logs from:
- Firewall
- DNS
- IDS / IPS
- VPN
- Servers & endpoints
- Normalizes events
- Applies correlation rules
Wazuh answers:
“Something suspicious happened. What is it?”
We do not overload Wazuh with business logic.
Its job is detection, not decision-making.
2. Automation & decision layer — SOAR (Shuffle)
Once something is detected, we need logic:
- Is this serious?
- Is this known malicious?
- Should we block, alert, or just log?
This is where SOAR comes in.
Shuffle allows us to build explicit security playbooks:
- Threat-intelligence enrichment
- Severity calculation
- Conditional response
- System-to-system orchestration
Shuffle answers:
“What should we do next?”
This is where engineering experience matters most.
3. Guaranteed human response — PagerDuty
Automation is powerful—but humans are still responsible.
PagerDuty ensures:
- The right person is notified
- Escalation happens if no one responds
- Response time is measurable (SLA)
PagerDuty answers:
“Who is responsible right now?”
This is the difference between alerts and accountability.
4. Investigation & audit trail — DFIRTrack
Every serious event becomes an incident:
- Evidence
- Timeline
- Decisions
- Actions taken
DFIRTrack provides:
- Incident records
- Asset tracking
- Investigation notes
- Audit readiness
DFIRTrack answers:
“What happened, exactly?”
This is essential for compliance, post-incident review, and trust.
How real use cases are implemented
Example 1: DNS communication to malicious domains
Problem
Malware almost always uses DNS to “phone home”.
System behavior
- DNS logs are collected
- Domain is compared against live threat-intelligence feeds
-
If malicious:
- Incident is created
- Endpoint is identified
- Firewall/DNS block is applied
- On-call engineer is notified (if severity is high)
This detects attacks before data is stolen.
Example 2: IDS / IPS traffic to known attacker IPs
Problem
Some attacks bypass endpoint security.
System behavior
- IDS/IPS logs are analyzed
- Destination IP matches known attacker infrastructure
- Correlated with asset criticality
- Automated containment or escalation
This avoids “signature spam” and focuses on real risk.
Example 3: VPN login success from outside Thailand
Problem
A successful login can still be an attack.
System behavior
- VPN authentication logs analyzed
- GeoIP enrichment applied
-
If login is successful from unexpected country:
- Risk score increases
- Incident created
- Optional forced verification or temporary block
This detects credential theft, not just brute force.
Why threat intelligence must be always fresh
Attackers rotate:
- Domains
- IPs
- Infrastructure
That’s why we build:
- Scheduled IOC updates
- Confidence scoring
- Expiration handling
- Automated enforcement updates
Security systems that rely on static rules become obsolete fast.
Why this architecture scales (and survives audits)
- Modular
- Vendor-neutral
- Open-source friendly
- Easy to extend
- Clear responsibility boundaries
This system works for:
- SMEs
- Factories
- Enterprises
- Managed security services (MDR)
Why clients hire us to build this
Because we don’t:
- Install tools and disappear
- Sell dashboards without response
- Hide logic inside black boxes
We design systems:
- With clear intent
- With documented logic
- With measurable outcomes
Security is not about tools.
It’s about decisions, timing, and responsibility.
If you are planning a similar system
If you’re thinking:
- “We want better security visibility”
- “We need real incident response, not alerts”
- “We want something we can understand and control”
Then this architecture is a strong foundation.
If you want it designed and implemented correctly,
with real operational experience behind it—
Let’s talk.
Final thought
Good security systems don’t feel complicated.
They feel calm, predictable, and under control.
That’s the system we build.
Get in Touch with us
Related Posts
- Why Your Finance Team Spends 40% of Their Week on Work AI Can Now Do
- 用纯开源方案搭建生产级 SOC:Wazuh + DFIR-IRIS + 自研集成层实战记录
- How We Built a Real Security Operations Center With Open-Source Tools
- FarmScript:我们如何从零设计一门农业IoT领域特定语言
- FarmScript: How We Designed a Programming Language for Chanthaburi Durian Farmers
- 智慧农业项目为何止步于试点阶段
- Why Smart Farming Projects Fail Before They Leave the Pilot Stage
- ERP项目为何总是超支、延期,最终令人失望
- ERP Projects: Why They Cost More, Take Longer, and Disappoint More Than Expected
- AI Security in Production: What Enterprise Teams Must Know in 2026
- 弹性无人机蜂群设计:具备安全通信的无领导者容错网状网络
- Designing Resilient Drone Swarms: Leaderless-Tolerant Mesh Networks with Secure Communications
- NumPy广播规则详解:为什么`(3,)`和`(3,1)`行为不同——以及它何时会悄悄给出错误答案
- NumPy Broadcasting Rules: Why `(3,)` and `(3,1)` Behave Differently — and When It Silently Gives Wrong Answers
- 关键基础设施遭受攻击:从乌克兰电网战争看工业IT/OT安全
- Critical Infrastructure Under Fire: What IT/OT Security Teams Can Learn from Ukraine’s Energy Grid
- LM Studio代码开发的系统提示词工程:`temperature`、`context_length`与`stop`词详解
- LM Studio System Prompt Engineering for Code: `temperature`, `context_length`, and `stop` Tokens Explained
- LlamaIndex + pgvector: Production RAG for Thai and Japanese Business Documents
- simpliShop:专为泰国市场打造的按需定制多语言电商平台













