๋ฐ˜์‘ํ˜•

๋Œ€๊ทœ๋ชจ ์ œ์กฐ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ๋ณด๊ณ ์„œ ์ƒ์„ฑ์„ ์œ„ํ•œ Agentic AI ์‹œ์Šคํ…œ์—์„œ
LLM ์ปดํ“จํŒ… ๋น„์šฉ๊ณผ ์‘๋‹ต ์ง€์—ฐ(latency) ์€ ์ฃผ์š” ๋ฌธ์ œ์ด๋‹ค.

๋‹ค์Œ ์‚ฌํ•ญ์„ ํฌํ•จํ•˜์—ฌ ํšจ์œจํ™” ์ „๋žต์„ ์ œ์‹œํ•˜์‹œ์˜ค.

 

1. ๋ชจ๋ธ ์„œ๋น™ ๋ฐ ์บ์‹ฑ ์ „๋žต (vLLM, Triton, TensorRT ๋“ฑ)

2. ํ† ํฐ ๋‹จ์œ„ ์ตœ์ ํ™” (Prompt/Response Caching, Prefix Tuning ๋“ฑ)

3. ๋ชจ๋ธ ์••์ถ• ๋ฐ ๋ถ„์‚ฐ ์„œ๋น™ ์ „๋žต (Quantization, Sharding, Mixture-of-Experts ๋“ฑ)

 

 

โ‘  ๋ฌธ์ œ ์ธ์‹

  • ์ œ์กฐ ํ˜„์žฅ์€ ๋Œ€๋Ÿ‰ ๋ณด๊ณ ์„œ(์ˆ˜์ฒœ๊ฑด/์ผ) ์ƒ์„ฑ ์š”๊ตฌ → LLM ํ˜ธ์ถœ ๋น„์šฉ·์ง€์—ฐ์ด ๊ธ‰์ฆ.
  • ๋”ฐ๋ผ์„œ LLM์˜ ์ปดํ“จํŒ… ํšจ์œจํ™”(Serving + Token + Storage) ๊ฐ€ ํ•ต์‹ฌ์ด๋‹ค.

โ‘ก ๋ชจ๋ธ ์„œ๋น™ ์ตœ์ ํ™”

์ „๋žต๊ธฐ์ˆ ์„ค๋ช…
vLLM Continuous batching + PagedAttention ์—ฌ๋Ÿฌ ์š”์ฒญ์„ ํ•œ ๋ฒˆ์— ์ฒ˜๋ฆฌํ•˜์—ฌ GPU ํ™œ์šฉ๋ฅ  ๊ทน๋Œ€ํ™”
Triton Server Multi-model serving LLM + ML๋ชจ๋ธ + RAG ์ธํผ๋Ÿฐ์Šค ํ†ตํ•ฉ ์„œ๋น™
TensorRT-LLM FP8 quant + graph fusion GPU inference latency 30~40% ๋‹จ์ถ•
Async Queue Redis + asyncio ๋™์‹œ ์š”์ฒญ์„ ๋น„๋™๊ธฐ๋กœ ํ์ž‰

์˜ˆ์‹œ ๊ตฌ์กฐ:

 
Client → API Gateway → vLLM → Cache → Report Agent

โ‘ข ํ† ํฐ ํšจ์œจํ™” ์ „๋žต

๋ฐฉ๋ฒ•์„ค๋ช…๊ธฐ๋Œ€ํšจ๊ณผ
Prompt Caching ๋™์ผ ์งˆ์˜ ํ”„๋กฌํ”„ํŠธ ํ•ด์‹œ ์ €์žฅ ๋ฐ˜๋ณต ๋ณด๊ณ ์„œ ์žฌ์‚ฌ์šฉ
Prefix Tuning ๊ณต์ •๋ณ„ ํŠนํ™” prefix๋งŒ ๋ฏธ์„ธ์กฐ์ • ํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜ ๊ฐ์†Œ(0.3~1%)
Response Caching “query+context hash” ์บ์‹œ ํ‚ค๋กœ ์ €์žฅ RAG ๋ฐ˜๋ณต ํ˜ธ์ถœ ๊ฐ์†Œ
Streaming Output ์ฆ‰์‹œ ์‘๋‹ต ์ŠคํŠธ๋ฆผ ์ „๋‹ฌ UX ๊ฐœ์„ , ์ง€์—ฐ ์ฒด๊ฐ ๊ฐ์†Œ

โ‘ฃ ๋ชจ๋ธ ์••์ถ• ๋ฐ ๋ถ„์‚ฐ ์„œ๋น™

๊ธฐ์ˆ ๋‚ด์šฉ์žฅ์ 
Quantization (4bit/8bit) ์ •๋ฐ€๋„ ๋‚ฎ์ถฐ ๋ฉ”๋ชจ๋ฆฌ ์ ˆ์•ฝ 70% GPU VRAM ์ ˆ๊ฐ
Sharding / ZeRO ๋Œ€ํ˜• ๋ชจ๋ธ์„ GPU๊ฐ„ ๋ถ„ํ•  ๋Œ€๊ทœ๋ชจ LLM ์„œ๋น™ ๊ฐ€๋Šฅ
MoE (Mixture of Experts) ์š”์ฒญ๋ณ„๋กœ ์ผ๋ถ€ ์ „๋ฌธ๊ฐ€ ๋ ˆ์ด์–ด๋งŒ ํ™œ์„ฑ ํ‰๊ท  ์—ฐ์‚ฐ๋Ÿ‰ 20~40% ๊ฐ์†Œ

โ‘ค ์‹ค๋ฌด ์‹œ๋‚˜๋ฆฌ์˜ค

  • 13B ๋ชจ๋ธ(vLLM) 3๊ฐœ → GPU 4์žฅ(48GB)
  • PromptCache ํ™œ์„ฑํ™” → ๋ฐ˜๋ณต ์งˆ์˜ ์‘๋‹ต ์†๋„ 3๋ฐฐ ๊ฐœ์„ 
  • FP8 TensorRT ๋ณ€ํ™˜ → ๋‹จ์ผ ๋ณด๊ณ ์„œ ์‘๋‹ต์‹œ๊ฐ„ 9.8s → 4.3s
  • ๋น„์šฉ ์ ˆ๊ฐ: GPU ์‚ฌ์šฉ๋ฅ  35% ↓, ์›” $3,000 ์ ˆ์•ฝ

โ‘ฅ ํ‰๊ฐ€ ํฌ์ธํŠธ

  • ๋ชจ๋ธ ์„œ๋น™ ๊ตฌ์กฐ(vLLM/Triton)์™€ ํ† ํฐ ์ตœ์ ํ™”๋ฅผ ๊ตฌ์ฒด์ ์œผ๋กœ ์–ธ๊ธ‰ํ–ˆ๋Š”๊ฐ€
  • Quantization/MoE ๊ฐ™์€ ์ปดํ“จํŒ… ์ ˆ๊ฐ ๊ธฐ์ˆ ์˜ ์›๋ฆฌ๋ฅผ ์„ค๋ช…ํ–ˆ๋Š”๊ฐ€
  • ์‹ค์ œ ์šด์˜ ํšจ๊ณผ(์†๋„·๋น„์šฉ ๊ฐœ์„ )๋ฅผ ์ˆ˜์น˜๋กœ ์ œ์‹œํ–ˆ๋Š”๊ฐ€
๋ฐ˜์‘ํ˜•

 

๋ฐ˜์‘ํ˜•

 

์ œ์กฐ๊ณต์ •์˜ ๋ถˆ๋Ÿ‰ ์›์ธ ๋ถ„์„(Anomaly Root Cause Analysis)์„ ์ž๋™ํ™”ํ•˜๊ธฐ ์œ„ํ•ด
Reasoning Chain ๊ธฐ๋ฐ˜ Agentic AI ๊ตฌ์กฐ๋ฅผ ์„ค๊ณ„ํ•˜๋ ค ํ•œ๋‹ค.

๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•œ “์ด์ƒ ํƒ์ง€ → ์›์ธ ์ถ”๋ก  → ๊ทผ๊ฑฐ ๋ฌธํ—Œ ์ธ์šฉ → ์กฐ์น˜ ์ œ์•ˆ” ๊ณผ์ •์„
Agent Chain ํ˜•ํƒœ๋กœ ๊ตฌ์„ฑํ•˜๊ณ , ๊ฐ ๋‹จ๊ณ„์˜ ์ž…๋ ฅ·์ถœ๋ ฅ ๊ตฌ์กฐ๋ฅผ ์„ค๊ณ„ํ•˜์‹œ์˜ค.

 

 

 

โ‘  ๋ชฉ์ 

  • ์ œ์กฐ ๋ถˆ๋Ÿ‰์˜ ์›์ธ์€ ๋‹จ์ผ ๋ณ€์ˆ˜๊ฐ€ ์•„๋‹Œ ๋‹ค์ˆ˜์˜ ์ƒํ˜ธ์ž‘์šฉ ๋ณ€์ˆ˜์— ์˜ํ•ด ๋ฐœ์ƒ.
  • LLM์ด ๋ฐ์ดํ„ฐ๋ฅผ ๊ทผ๊ฑฐ๋กœ ๋…ผ๋ฆฌ์  Reasoning Chain์„ ๋”ฐ๋ผ๊ฐ€๋ฉฐ
    ์›์ธ์„ ์„ค๋ช…ํ•˜๊ณ  ๊ทผ๊ฑฐ ๋ฌธ์„œ๋ฅผ ์ธ์šฉํ•ด์•ผ ํ•จ.

โ‘ก ์ „์ฒด ๊ตฌ์กฐ

[Sensor Data] โ”€โ”€> Anomaly-Agent
                    ↓
                RootCause-Agent
                    ↓
                RAG-Agent
                    ↓
                Action-Agent
                    ↓
                Report-Agent

โ‘ข ๋‹จ๊ณ„๋ณ„ ์—ญํ• 

Agent์ž…๋ ฅ์ฒ˜๋ฆฌ ๋กœ์ง์ถœ๋ ฅ
Anomaly-Agent ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ Isolation Forest / TCN ์ด์ƒ ๊ตฌ๊ฐ„ (time range, variables)
RootCause-Agent ์ด์ƒ ๊ตฌ๊ฐ„ ๋ฐ์ดํ„ฐ ์ƒ๊ด€๋ถ„์„, SHAP, Causal Inference ์ฃผ์š” ๋ณ€์ˆ˜·์˜ํ–ฅ๋„
RAG-Agent ๋ณ€์ˆ˜๋ช…, ๊ณต์ •๋ช… ๋ฌธํ—Œ·SOP ๊ฒ€์ƒ‰ ๊ด€๋ จ ์ ˆ์ฐจ/ํ—ˆ์šฉ๋ฒ”์œ„
Action-Agent ์›์ธ+SOP ๋‚ด์šฉ ์กฐ์น˜ ์ œ์•ˆ ์ƒ์„ฑ ์กฐ์น˜ ํ…์ŠคํŠธ
Report-Agent ๋ชจ๋“  ๊ฒฐ๊ณผ ๋ฆฌํฌํŠธ ํ†ตํ•ฉ PDF/DOCX ๋ณด๊ณ ์„œ

โ‘ฃ ์˜ˆ์‹œ ์‹œ๋‚˜๋ฆฌ์˜ค

์ž…๋ ฅ: 2025-10-18 ๋ผ์ธ2 ์ˆ˜์œจ ๊ธ‰๋ฝ

Anomaly-Agent: OvenTemp(±8โ„ƒ), Speed(1.1m/s) ๊ฐ์ง€
RootCause-Agent: Corr(Temp,Yield)=−0.81 → ์ฃผ์š”์›์ธ Temp
RAG-Agent: SOP-HT-221 §3.2 ์ธ์šฉ (ํ—ˆ์šฉ ±5โ„ƒ)
Action-Agent: “์˜จ๋„ PID ์žฌํŠœ๋‹ ๋ฐ ์„ผ์„œ ์ ๊ฒ€”
Report-Agent: ๊ทผ๊ฑฐ ํฌํ•จ ๋ณด๊ณ ์„œ ์™„์„ฑ


โ‘ค LangGraph ์›Œํฌํ”Œ๋กœ ์˜ˆ์‹œ

nodes:
  - anomaly_agent
  - rootcause_agent
  - rag_agent
  - action_agent
  - report_agent
edges:
  - anomaly_agent -> rootcause_agent
  - rootcause_agent -> rag_agent
  - rag_agent -> action_agent
  - action_agent -> report_agent
  • ๊ฐ ๋…ธ๋“œ์˜ ์ถœ๋ ฅ์€ JSON ํ˜•ํƒœ๋กœ ์ „๋‹ฌ:
 
{
  "variable": "OvenTemp",
  "deviation": 8,
  "impact": 0.81,
  "sop_reference": "SOP-HT-221 §3.2",
  "recommended_action": "Adjust PID controller"
}

โ‘ฅ ๊ธฐ์ˆ ์  ํฌ์ธํŠธ

์˜์—ญ๊ธฐ์ˆ ์„ค๋ช…
์ด์ƒํƒ์ง€ IsolationForest / TCN ์‹ค์‹œ๊ฐ„ ์ด์ƒ ๊ฐ์ง€
์›์ธ์ถ”๋ก  SHAP, CausalImpact ๋ณ€์ˆ˜ ์˜ํ–ฅ๋„ ์ถ”์ •
๊ทผ๊ฑฐ๊ฒ€์ƒ‰ BM25+pgvector RAG SOP/WI ์ธ์šฉ
์กฐ์น˜์ƒ์„ฑ LLM (Instruction-tuned) ์ž์—ฐ์–ด ์กฐ์น˜ ์ƒ์„ฑ
์ฒด์ธ๊ด€๋ฆฌ LangGraph ํ”Œ๋กœ์šฐ ๋ฐ ์žฌ์‹œ๋„ ๊ด€๋ฆฌ

โ‘ฆ ํ‰๊ฐ€ ํฌ์ธํŠธ

  • ๋‹จ๊ณ„๋ณ„ ์ž…๋ ฅ/์ถœ๋ ฅ ๊ตฌ์กฐ๋ฅผ ๋ช…ํ™•ํžˆ ์ œ์‹œํ–ˆ๋Š”๊ฐ€
  • RootCause-Agent๊ฐ€ ์ˆ˜์น˜์ /๋ฌธํ—Œ์  ๊ทผ๊ฑฐ๋ฅผ ๊ฒฐํ•ฉํ•˜๋Š” ๊ตฌ์กฐ๋ฅผ ์„ค๋ช…ํ–ˆ๋Š”๊ฐ€
  • ์ตœ์ข… ๋ฆฌํฌํŠธ ์ƒ์„ฑ๊นŒ์ง€์˜ Reasoning Chain์„ ๋…ผ๋ฆฌ์ ์œผ๋กœ ๊ตฌ์„ฑํ–ˆ๋Š”๊ฐ€โ‘  ๋ชฉ์ 
    • ์ œ์กฐ ๋ถˆ๋Ÿ‰์˜ ์›์ธ์€ ๋‹จ์ผ ๋ณ€์ˆ˜๊ฐ€ ์•„๋‹Œ ๋‹ค์ˆ˜์˜ ์ƒํ˜ธ์ž‘์šฉ ๋ณ€์ˆ˜์— ์˜ํ•ด ๋ฐœ์ƒ.
    • LLM์ด ๋ฐ์ดํ„ฐ๋ฅผ ๊ทผ๊ฑฐ๋กœ ๋…ผ๋ฆฌ์  Reasoning Chain์„ ๋”ฐ๋ผ๊ฐ€๋ฉฐ
      ์›์ธ์„ ์„ค๋ช…ํ•˜๊ณ  ๊ทผ๊ฑฐ ๋ฌธ์„œ๋ฅผ ์ธ์šฉํ•ด์•ผ ํ•จ.

    โ‘ก ์ „์ฒด ๊ตฌ์กฐ
    โ‘ข ๋‹จ๊ณ„๋ณ„ ์—ญํ• 
    โ‘ฃ ์˜ˆ์‹œ ์‹œ๋‚˜๋ฆฌ์˜ค
    โ‘ค LangGraph ์›Œํฌํ”Œ๋กœ ์˜ˆ์‹œ
    • ๊ฐ ๋…ธ๋“œ์˜ ์ถœ๋ ฅ์€ JSON ํ˜•ํƒœ๋กœ ์ „๋‹ฌ:
     
    {
      "variable": "OvenTemp",
      "deviation": 8,
      "impact": 0.81,
      "sop_reference": "SOP-HT-221 §3.2",
      "recommended_action": "Adjust PID controller"
    }

    โ‘ฅ ๊ธฐ์ˆ ์  ํฌ์ธํŠธ
    โ‘ฆ ํ‰๊ฐ€ ํฌ์ธํŠธ
    • ๋‹จ๊ณ„๋ณ„ ์ž…๋ ฅ/์ถœ๋ ฅ ๊ตฌ์กฐ๋ฅผ ๋ช…ํ™•ํžˆ ์ œ์‹œํ–ˆ๋Š”๊ฐ€
    • RootCause-Agent๊ฐ€ ์ˆ˜์น˜์ /๋ฌธํ—Œ์  ๊ทผ๊ฑฐ๋ฅผ ๊ฒฐํ•ฉํ•˜๋Š” ๊ตฌ์กฐ๋ฅผ ์„ค๋ช…ํ–ˆ๋Š”๊ฐ€
    • ์ตœ์ข… ๋ฆฌํฌํŠธ ์ƒ์„ฑ๊นŒ์ง€์˜ Reasoning Chain์„ ๋…ผ๋ฆฌ์ ์œผ๋กœ ๊ตฌ์„ฑํ–ˆ๋Š”๊ฐ€
  • ์˜์—ญ๊ธฐ์ˆ ์„ค๋ช…
    ์ด์ƒํƒ์ง€ IsolationForest / TCN ์‹ค์‹œ๊ฐ„ ์ด์ƒ ๊ฐ์ง€
    ์›์ธ์ถ”๋ก  SHAP, CausalImpact ๋ณ€์ˆ˜ ์˜ํ–ฅ๋„ ์ถ”์ •
    ๊ทผ๊ฑฐ๊ฒ€์ƒ‰ BM25+pgvector RAG SOP/WI ์ธ์šฉ
    ์กฐ์น˜์ƒ์„ฑ LLM (Instruction-tuned) ์ž์—ฐ์–ด ์กฐ์น˜ ์ƒ์„ฑ
    ์ฒด์ธ๊ด€๋ฆฌ LangGraph ํ”Œ๋กœ์šฐ ๋ฐ ์žฌ์‹œ๋„ ๊ด€๋ฆฌ
  • nodes:
      - anomaly_agent
      - rootcause_agent
      - rag_agent
      - action_agent
      - report_agent
    edges:
      - anomaly_agent -> rootcause_agent
      - rootcause_agent -> rag_agent
      - rag_agent -> action_agent
      - action_agent -> report_agent
  • ์ž…๋ ฅ: 2025-10-18 ๋ผ์ธ2 ์ˆ˜์œจ ๊ธ‰๋ฝ
  • Anomaly-Agent: OvenTemp(±8โ„ƒ), Speed(1.1m/s) ๊ฐ์ง€
    RootCause-Agent: Corr(Temp,Yield)=−0.81 → ์ฃผ์š”์›์ธ Temp
    RAG-Agent: SOP-HT-221 §3.2 ์ธ์šฉ (ํ—ˆ์šฉ ±5โ„ƒ)
    Action-Agent: “์˜จ๋„ PID ์žฌํŠœ๋‹ ๋ฐ ์„ผ์„œ ์ ๊ฒ€”
    Report-Agent: ๊ทผ๊ฑฐ ํฌํ•จ ๋ณด๊ณ ์„œ ์™„์„ฑ
  • Agent์ž…๋ ฅ์ฒ˜๋ฆฌ ๋กœ์ง์ถœ๋ ฅ
    Anomaly-Agent ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ Isolation Forest / TCN ์ด์ƒ ๊ตฌ๊ฐ„ (time range, variables)
    RootCause-Agent ์ด์ƒ ๊ตฌ๊ฐ„ ๋ฐ์ดํ„ฐ ์ƒ๊ด€๋ถ„์„, SHAP, Causal Inference ์ฃผ์š” ๋ณ€์ˆ˜·์˜ํ–ฅ๋„
    RAG-Agent ๋ณ€์ˆ˜๋ช…, ๊ณต์ •๋ช… ๋ฌธํ—Œ·SOP ๊ฒ€์ƒ‰ ๊ด€๋ จ ์ ˆ์ฐจ/ํ—ˆ์šฉ๋ฒ”์œ„
    Action-Agent ์›์ธ+SOP ๋‚ด์šฉ ์กฐ์น˜ ์ œ์•ˆ ์ƒ์„ฑ ์กฐ์น˜ ํ…์ŠคํŠธ
    Report-Agent ๋ชจ๋“  ๊ฒฐ๊ณผ ๋ฆฌํฌํŠธ ํ†ตํ•ฉ PDF/DOCX ๋ณด๊ณ ์„œ
  • [Sensor Data] โ”€โ”€> Anomaly-Agent
                        ↓
                    RootCause-Agent
                        ↓
                    RAG-Agent
                        ↓
                    Action-Agent
                        ↓
                    Report-Agent
๋ฐ˜์‘ํ˜•
๋ฐ˜์‘ํ˜•

์ œ์กฐ๊ณต์ •์˜ ๋ฌธ์„œ์™€ ๋ฐ์ดํ„ฐ ๊ฐ„ ๊ด€๊ณ„๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๊ด€๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด
Knowledge Graph (KG) ์™€ RAG ๋ฅผ ๊ฒฐํ•ฉํ•œ ์ง€์‹ ๊ธฐ๋ฐ˜ Agentic AI๋ฅผ ๊ตฌ์ถ•ํ•˜๋ ค ํ•œ๋‹ค.

๋‘ ์‹œ์Šคํ…œ์˜ ์—ญํ• ์„ ๋น„๊ตํ•˜๊ณ ,
KG ๊ธฐ๋ฐ˜ RAG Retrieval ๊ตฌ์กฐ ๋ฐ ์ถ”๋ก  ํ๋ฆ„์„ ์„ค๊ณ„ํ•˜์—ฌ ์„ค๋ช…ํ•˜์‹œ์˜ค.

 

 

 

โ‘  ๊ฐœ๋… ๋น„๊ต

ํ•ญ๋ชฉRAGKnowledge Graph
๋ชฉ์  ๋ฌธ์„œ ๊ธฐ๋ฐ˜ ์˜๋ฏธ ๊ฒ€์ƒ‰ ์—”ํ‹ฐํ‹ฐ ๊ฐ„ ๊ด€๊ณ„ ์ถ”๋ก 
๋‹จ์œ„ ๋ฌธ์žฅ/๋ฌธ๋‹จ ๋…ธ๋“œ/์—ฃ์ง€(์—”ํ‹ฐํ‹ฐ ๊ด€๊ณ„)
์žฅ์  ๋น ๋ฅธ ๊ฒ€์ƒ‰·์š”์•ฝ ๋…ผ๋ฆฌ์  ๊ด€๊ณ„ ๊ธฐ๋ฐ˜ ์ถ”๋ก 
ํ•œ๊ณ„ ๋ฌธ๋งฅ ๋‹จ์ ˆ ๋Œ€๊ทœ๋ชจ ๊ตฌ์ถ• ๋น„์šฉ

→ ๊ฒฐํ•ฉ ์‹œ “๊ฒ€์ƒ‰ + ์ถ”๋ก ”์ด ๋™์‹œ์— ๊ฐ€๋Šฅ.


โ‘ก ํ†ตํ•ฉ ๊ตฌ์กฐ ๊ฐœ์š”

 
[๋ฌธ์„œ/DB] → [Chunking + Embedding] → RAG Index(OpenSearch + pgvector)
[์ง€์‹๋งต] → [Entity/Relation ์ถ”์ถœ] → Knowledge Graph (Neo4j)
       ↓
   KG-RAG Fusion Retriever
       ↓
  Reasoning LLM (LangGraph Agent)
       ↓
  ์ธ์šฉ + ๊ด€๊ณ„๊ธฐ๋ฐ˜ ๋ณด๊ณ ์„œ ์ƒ์„ฑ

โ‘ข Fusion Retrieval ์›๋ฆฌ

  1. Query๋ฅผ Entity ๋ฐ Relation์œผ๋กœ ํŒŒ์‹ฑ (์˜ˆ: “์˜จ๋„ ํŽธ์ฐจ๊ฐ€ ์ˆ˜์œจ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ”)
  2. KG์—์„œ ์—ฐ๊ฒฐ๋œ ์—”ํ‹ฐํ‹ฐ ํƒ์ƒ‰:
    MATCH (p:Parameter)-[:AFFECTS]->(k:KPI {name:'Yield'}) RETURN p.name, p.importance
  3. ์—ฐ๊ด€ ์—”ํ‹ฐํ‹ฐ ํ‚ค์›Œ๋“œ๋ฅผ RAG ๊ฒ€์ƒ‰์–ด์— ์ถ”๊ฐ€:
    query_terms = ["temperature deviation", "yield loss", "oven parameter"]
  4. ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๊ฒ€์ƒ‰(BM25+Vector) ์ˆ˜ํ–‰ → ๊ทผ๊ฑฐ ๋ฌธ๋‹จ ์ˆ˜์ง‘.
  5. LLM์ด KG ๊ด€๊ณ„ + ๋ฌธ์„œ ๊ทผ๊ฑฐ๋ฅผ ํ•จ๊ป˜ ์ธ์šฉํ•˜์—ฌ ๋‹ต๋ณ€ ์ƒ์„ฑ.

โ‘ฃ ์˜ˆ์‹œ ์‘๋‹ต ๊ตฌ์กฐ

์งˆ๋ฌธ: ์˜ค๋ธ ์˜จ๋„ ํŽธ์ฐจ๊ฐ€ ์ˆ˜์œจ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์€?
RAG ์ธ์šฉ: “SOP-HT-221 §3.2: ±5โ„ƒ ์ดˆ๊ณผ ์‹œ ํ’ˆ์งˆ ์ €ํ•˜ ๋ฐœ์ƒ.”
KG ๊ด€๊ณ„: (TemperatureDeviation)–AFFECTS–(YieldLoss) weight=0.86
AI ๋‹ต๋ณ€: “์˜จ๋„ ํŽธ์ฐจ๋Š” ์ˆ˜์œจ ์ €ํ•˜์˜ ์ฃผ์š” ์›์ธ(์ƒ๊ด€๋„ 0.86)์œผ๋กœ SOP-HT-221์—์„œ ํ—ˆ์šฉ๋ฒ”์œ„๋ฅผ ±5โ„ƒ๋กœ ์ œํ•œํ•˜๊ณ  ์žˆ๋‹ค.”


โ‘ค ๊ธฐ์ˆ  ๊ตฌ์„ฑ์š”์†Œ

๋ชจ๋“ˆ๊ธฐ์ˆ ์„ค๋ช…
Entity Extractor spaCy, Llama3 NER KPI·Parameter·Equipment ์ถ”์ถœ
Graph DB Neo4j, ArangoDB ๊ด€๊ณ„ ์ €์žฅ·ํƒ์ƒ‰
KG-RAG Fusion Custom Retriever KG ๊ธฐ๋ฐ˜ Query Expansion
Agent LangGraph RAG + KG ๋ณ‘ํ•ฉ ์ถ”๋ก 
Visualization NeoDash, Grafana ๊ด€๊ณ„ ์‹œ๊ฐํ™”

โ‘ฅ ์‹ค๋ฌดํšจ๊ณผ

ํ•ญ๋ชฉ๊ธฐ์กด RAGKG+RAG
์ธ์šฉ ํ’ˆ์งˆ ๋ฌธ์žฅ ๊ธฐ๋ฐ˜ ๊ด€๊ณ„ ๊ธฐ๋ฐ˜ ๊ทผ๊ฑฐ ๋ณด๊ฐ•
๋„๋ฉ”์ธ ์ดํ•ด ์•ฝํ•จ ์—”ํ‹ฐํ‹ฐ ๊ด€๊ณ„ ๊ธฐ๋ฐ˜ ์‹ฌํ™”
์žฌํ˜„์„ฑ ์ค‘๊ฐ„ ๋†’์Œ
ํ™•์žฅ์„ฑ ๋ฌธ์„œ ์ฆ๊ฐ€ ์˜ํ–ฅ ํผ ์—”ํ‹ฐํ‹ฐ ๊ด€๊ณ„๋งŒ ์ถ”๊ฐ€๋กœ ํ™•์žฅ ์šฉ์ด

โ‘ฆ ํ‰๊ฐ€ ํฌ์ธํŠธ

  • RAG์™€ KG์˜ ์ฐจ์ด๋ฅผ ๋ช…ํ™•ํžˆ ์„ค๋ช…ํ–ˆ๋Š”๊ฐ€
  • KG ๊ธฐ๋ฐ˜ Query Expansion ๊ตฌ์กฐ๋ฅผ ๊ตฌ์ฒดํ™”ํ–ˆ๋Š”๊ฐ€
  • Fusion Retriever์˜ ์ถ”๋ก  ์ ˆ์ฐจ๋ฅผ ๋‹จ๊ณ„๋ณ„๋กœ ์ œ์‹œํ–ˆ๋Š”๊ฐ€
๋ฐ˜์‘ํ˜•

+ Recent posts