Skip to content
View daeun-ops's full-sized avatar

Block or report daeun-ops

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
daeun-ops/README.md

๐Ÿ‘‹ HELLO WORLD

I'm Daeun (Sophie) Kim ๐Ÿ’™

Cloud / Platform / Observability Engineer
Backend โ†’ Kubernetes โ†’ Production Reliability
Now teaching cloud-native engineering (2024.03 ~ now)

I build...! reliable infra, observable systems, and engineers who can run them.

Tech Stack

IaC / GitOps

Kubernetes / Cloud

Observability

LLM / AI

Languages / Frameworks

Messaging / DB

Security / Policy

Collaboration


About Me[Eng]

Who I am

Iโ€™m not โ€œsomeone who deploys YAML.โ€
Iโ€™m someone who has killed production by accident, revived it at 3AM, and then made sure it wonโ€™t wake anyone up again.

  • 2 years of hands-on work as a Software / Platform Engineer
    (backend services, AWS, containers, Kubernetes monitoring products)
  • Currently a technical instructor for cloud-native engineering bootcamps
    (Java/Spring, Cloud, Kubernetes, IaC, CI/CD, GitOps, Ops culture)

I focus on:

  • Production reliability (not just โ€œit runs,โ€ but โ€œit survivesโ€)
  • Observability as a first-class requirement
  • Teaching people how to operate, not just deploy

I build environments you can trust, even if youโ€™re half-asleep and on-call.


Experience Snapshot

Software / Platform Engineer

  • Built and operated e-commerce backend microservices on AWS.
  • Joined a Kubernetes monitoring product team:
    • Provisioned and maintained multiple K8s environments for agent developers and QA.
    • Ran agent load tests to measure CPU/memory/network impact.
    • Tuned resource usage so customer clusters stayed stable.
  • Reverse-engineered competitors, identified feature gaps, and turned them into roadmap items and demos.
  • Supported on-prem / customer installs and led hands-on troubleshooting sessions.

Technical Instructor (Mar 2024 โ†’ present)

  • Teach Java / Spring Boot, Cloud (AWS/Azure/GCP), Docker/K8s, IaC, CI/CD, GitOps.
  • Train students on delivery the way real product teams ship: planning โ†’ release โ†’ monitoring โ†’ RCA โ†’ iteration.
  • Mission-driven: proving (with execution, not degrees) that nontraditional engineers can deliver production-grade work โ€” and even mentor others.

This is SophieLabs Infrastructure Demo

A personal lab where I rebuild โ€œseriousโ€ infra from scratch โ€” as code.

  • IaC + GitOps + Observability stack bootstrapped end-to-end
  • SLO / SLI / Error Budget culture simulated like a real on-call team
  • FinOps / DR / Policy / Supply Chain Security wired in from day 0

Most Active
Most Active
Most Active


Ongoing Projects

Project Link What it is
Digital Asset Exchange Infra 2025-demo-01 24/365 โ€œnever stop tradingโ€ infra: EKS, MSK(Kafka), Aurora, ClickHouse, Istio, ArgoCD. Multi-repo, infra-as-code, Binance-style reliability simulation.
PromQL Assistant CLI
Text โ†’ PromQL via LLM
promql-assistant-cli Turn โ€œshow pods above 90% CPUโ€ into valid PromQL. Built for on-call humans who don't want to fight dashboards at 2AM.
hello-ebpf-demo
Kernel tracing / eBPF
hello-ebpf-demo Trace kernel-level events with eBPF and ship them to user space via a Go loader. For performance / runtime visibility.
LLM Observability Stack
Datadog + OTel + Llama3 + Grafana
datadog-llm-workshop Treat LLM pipelines like production systems: latency, token cost, RAG path, failure hotspots โ€” all observable.
Hybrid MLOps Platform hybrid-mlops-demo Cloud + On-Prem ML pipeline. Airflow, MLflow, Ray Serve (GPU), EKS. Training + inference + metrics in one workflow.

Study Log

Area Repo Notes
Terraform / IaC Lab terraform-playground Terraform experiments for network/cluster provisioning and multi-env patterns.
Kubernetes Lab kubernetes-playground Namespace strategy, ArgoCD sync, multi-cluster ops patterns.

Motto

โ€œPerfect software doesnโ€™t exist.
Reliable infrastructure does โ€” and it must be code.
And that code must prove itself through observability.โ€
โ€” Sophie


About Me[Kor]

๋‚˜๋Š” ์–ด๋–ค ์‚ฌ๋žŒ์ธ๊ฐ€์š”?!

์ €๋Š” โ€œKubernetes ์ข€ ๋งŒ์ ธ๋ดค์–ด์š”โ€ ํ•˜๋Š” ์‚ฌ๋žŒ์ด ์•„๋‹ˆ์—์š”.
์ €๋Š” ์„œ๋น„์Šค๋ฅผ ํ•œ๋ฒˆ์€ ๋ง๊ฐ€๋œจ๋ ค๋ณด๊ณ , ์ง์ ‘ ์‚ด๋ ค๋ณด๊ณ ,
๊ทธ ๋‹ค์Œ์—” ๋‹ค์‹œ๋Š” ์ƒˆ๋ฒฝ์— ์•„๋ฌด๋„ ์•ˆ ๊นจ๋„ ๋˜๊ฒŒ ๋งŒ๋“œ๋Š” ์‚ฌ๋žŒ์ด ๋˜๋ ค๊ณ  ๋ชฉ์ˆจ์„ ๊ฒ๋‹ˆ๋‹ค!

  • Backend Engineer & Platform Engineer ๊ฒฝ๋ ฅ 2๋…„
    (์ „์ž์ƒ๊ฑฐ๋ž˜ MSA on AWS โ†’ ์ฟ ๋ฒ„๋„คํ‹ฐ์Šค ๋ชจ๋‹ˆํ„ฐ๋ง ์ œํ’ˆํŒ€์œผ๋กœ ์ด์ง)
  • 2024.03๋ถ€ํ„ฐ๋Š” ์‹ค์ œ ๋ถ€ํŠธ์บ ํ”„์—์„œ ๊ธฐ์ˆ  ๊ฐ•์˜ ์ค‘
    (Java/Spring, ํด๋ผ์šฐ๋“œ, ์ฟ ๋ฒ„๋„คํ‹ฐ์Šค, IaC, CI/CD, GitOps, ์šด์˜๋ฌธํ™”๊นŒ์ง€)

์ œ๊ฐ€ ์ง‘์š”ํ•˜๊ฒŒ ์ง‘์ฐฉํ•˜๋Š” ๊ฒƒ์€!

  • โ€œ์ผ๋‹จ ๋Œ์•„๊ฐ„๋‹คโ€๊ฐ€ ์•„๋‹ˆ๋ผ โ€œํ„ฐ์ ธ๋„ ์‚ฐ๋‹คโ€
  • Observability๋Š” ๊ธฐ๋Šฅ์ด ์•„๋‹ˆ๋ผ ์š”๊ตฌ์‚ฌํ•ญ์œผ๋กœ ๋‘๋Š” ๊ฒƒ
  • ๋ฐฐํฌ ๋ฐฉ๋ฒ•์ด ์•„๋‹ˆ๋ผ ์šด์˜ ๋ฐฉ๋ฒ•๊นŒ์ง€ ๊ฐ€๋ฅด์น˜๋Š” ๊ฒƒ

โ€œ๋ˆ„๊ฐ€ ์ƒˆ๋ฒฝ 3์‹œ์— ๊นจ์›Œ๋„ ๋ฏฟ๊ณ  ๋งก๊ธธ ์ˆ˜ ์žˆ๋Š” ํ™˜๊ฒฝโ€
๊ทธ๊ฑธ ์„ค๊ณ„ํ•˜๊ณ  ๋งŒ๋“ค๊ณ ์ž ํƒœ์–ด๋‚œ ์‚ฌ๋žŒ์ž…๋‹ˆ๋‹ค.


๊ฒฝํ—˜ ์š”์•ฝ

Backend Engineer & Platform Engineer

  • AWS ๊ธฐ๋ฐ˜ ์ „์ž์ƒ๊ฑฐ๋ž˜ MSA ๋ฐฑ์—”๋“œ ๊ตฌ์ถ• ๋ฐ ์šด์˜
  • ์ดํ›„ K8s ๋ชจ๋‹ˆํ„ฐ๋ง ์ œํ’ˆํŒ€์œผ๋กœ ์ด์ง
    • Agent ๊ฐœ๋ฐœ์„ ์œ„ํ•œ ์—ฌ๋Ÿฌ K8s์ œํ’ˆ ๊ตฌ์ถ•์„ ํ†ตํ•ด ๊ฐœ๋ฐœ ํ™˜๊ฒฝ ์ œ๊ณต
    • Agent ๋ฆฌ์†Œ์Šค Usage (memory/CPU/network) ๋ถ€ํ•˜ ํ…Œ์ŠคํŠธ
    • ๋ฆฌ์†Œ์Šค Tuningํ•ด์„œ ๊ณ ๊ฐ์‚ฌ Cluster ์•ˆ์ •์„ฑ ์œ ์ง€
  • ๊ฒฝ์Ÿ์‚ฌ ์†”๋ฃจ์…˜ ๋ถ„์„ โ†’ ๊ธฐ๋Šฅ ๊ฒฉ์ฐจ ์ •์˜ โ†’ ๊ธฐ๋Šฅ ๊ฐœ์„ ์•ˆ / Demo๊นŒ์ง€ ์—ฐ๊ฒฐ
  • On-prem ๊ณ ๊ฐ์‚ฌ ํ™˜๊ฒฝ ์„ค์น˜ ์ง€์›, ๋ผ์ด๋ธŒ ํŠธ๋Ÿฌ๋ธ”์ŠˆํŒ… ์ฐธ์—ฌ (์ง„์งœ ์ „์Ÿํ„ฐ)

๋ถ€ํŠธ์บ ํ”„ ๊ฐ•์‚ฌ (2024.03 ~ ์ง„ํ–‰ ์ค‘)

  • Java / Spring Boot ๋ฐฑ์—”๋“œ, ํผ๋ธ”๋ฆญ ํด๋ผ์šฐ๋“œ(AWS/Azure/GCP), Docker / Kubernetes, IaC, CI/CD, GitOps ๊ต์œก
  • ์‹ค์ œ ํ”„๋กœ๋•ํŠธ ํŒ€์˜ ์ผํ•˜๋Š” ์ˆœ์„œ๋ฅผ ๊ทธ๋Œ€๋กœ ๊ฐ€๋ฅด์นจ
    ๊ธฐํš โ†’ ๋ฐฐํฌ โ†’ ๋ชจ๋‹ˆํ„ฐ๋ง โ†’ RCA โ†’ ๊ฐœ์„ 
  • ๋ชฉํ‘œ๋Š” โ€œ๋ฐฐํฌ ๋ฒ„ํŠผ ๋ˆ„๋ฅผ ์ˆ˜ ์žˆ๋Š” ์‚ฌ๋žŒโ€์ด ์•„๋‹ˆ๋ผ
    โ€œ์„œ๋น„์Šค๋ฅผ ์ฑ…์ž„์งˆ ์ˆ˜ ์žˆ๋Š” ์‚ฌ๋žŒโ€์„ ๋งŒ๋“œ๋Š” ๊ฒƒ

์ €๋Š” ๋น„์ „๊ณต์ž ์ถœ์‹ ๋„ ์‹ค์ œ ํ”„๋กœ๋•์…˜์„ ์ฑ…์ž„์งˆ ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฑธ ์ œ ๊ฒฝ๋ ฅ์œผ๋กœ ์ฆ๋ช…ํ•˜๊ณ  ์‹ถ์Šต๋‹ˆ๋‹ค.

์ €๋Š” ์‹คํ–‰๋ ฅ๊ณผ ์ž„ํŒฉํŠธ๋กœ ๊ฒฐ๊ณผ๋ฅผ ๋งŒ๋“ค์–ด๋‚ด๊ณ , ๋™์‹œ์— ๋‹ค๋ฅธ ์‚ฌ๋žŒ์„ ๊ฐ€๋ฅด์น  ์ˆ˜๋„ ์žˆ๋‹ค๋Š” ๊ฑธ ๋ณด์—ฌ์ค„ ์ˆ˜ ์žˆ๋Š” ์—ญํ• ์— ํŠนํžˆ ๊ด€์‹ฌ์ด ์žˆ์Šต๋‹ˆ๋‹ค. ๊ฐ•์‚ฌ๋กœ์„œ์˜ ์ œ ์ผ์€ ๊ทธ ๋ฏธ์…˜์˜ ์ผ๋ถ€์ž…๋‹ˆ๋‹ค. ์ €๋Š” ํ•™์ƒ๋“ค์ด ์‹ค์ œ ํ”„๋กœ๋•์…˜ ์ˆ˜์ค€์˜ ์—ญ๋Ÿ‰์„ ๊ฐ–์ถ”๋„๋ก ๋•๊ณ , โ€˜๋ฐฐ๊ฒฝ์ด ๋‹ค๋ฅด๋‹คโ€™๋Š” ์ด์œ ๋งŒ์œผ๋กœ ๊ธฐ์ˆ ์  ๊นŠ์ด๋‚˜ ๋ฆฌ๋”์‹ญ์ด ์ œํ•œ๋˜์ง€ ์•Š๋Š”๋‹ค๋Š” ๊ฑธ ์ฆ๋ช…ํ•˜๊ธฐ ์œ„ํ•ด ๊ต์œก ํ˜„์žฅ์—์„œ ๊ฒฝํ—˜์„ ์Œ“์•„์™”์Šต๋‹ˆ๋‹ค.


SophieLabs ์ธํ”„๋ผ ์‹คํ—˜์‹ค

SophieLabs ๋Š” ์ œ๊ฐ€ ์ง์ ‘ ๋งŒ๋“œ๋Š” ๊ฐœ์ธ ์—ฐ๊ตฌ ํ™˜๊ฒฝ์ž…๋‹ˆ๋‹ค.
๋ชฉํ‘œ๋Š” ๊ฐ„๋‹จํ•ด์š”:
โ€œ์ง„์งœ ํšŒ์‚ฌ๋ฅผ ํ‰๋‚ด๋‚ด์ง€ ๋ง๊ณ , ๊ทธ๋ƒฅ ๋‚ด๊ฐ€ ํšŒ์‚ฌ์ฒ˜๋Ÿผ ๊ตด๋ฆฌ์ž.โ€

  • IaC + GitOps + Observability ์ „์ฒด ํŒŒ์ดํ”„๋ผ์ธ ์ž๋™ํ™”
  • SLO / SLI / Error Budget ๊ฐ™์€ ์šด์˜ ๋ฌธํ™”๊นŒ์ง€ ์ฝ”๋“œ๋กœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜
  • FinOps / DR / Policy / Supply Chain Security ๋ฅผ ์ดˆ๋ฐ˜๋ถ€ํ„ฐ ๊ตฌ์กฐ ์•ˆ์— ์‹ฌ๋Š” ๋ฐฉ์‹ ์—ฐ๊ตฌ

Today Most Active
Today Most Active
Today Most Active


์ง„ํ–‰ ์ค‘์ธ ํ”„๋กœ์ ํŠธ

ํ”„๋กœ์ ํŠธ ๋งํฌ ์„ค๋ช…
Digital Asset Exchange Infra 2025-demo-01 24/365 ๋ฉˆ์ถ”์ง€ ์•Š๋Š” ๊ฐ€์ƒ์ž์‚ฐ ๊ฑฐ๋ž˜์†Œ ์ธํ”„๋ผ ์‹คํ—˜. EKS, MSK(Kafka), Aurora, ClickHouse, Istio, ArgoCD ๋“ฑ ์ „์ฒด ๊ตฌ์„ฑ์„ ์ฝ”๋“œ๋กœ ๊ด€๋ฆฌ. (10๊ฐœ ์ด์ƒ ๋ ˆํฌ ๊ตฌ์กฐ)
PromQL Assistant CLI
์ž์—ฐ์–ด โ†’ PromQL
promql-assistant-cli โ€œCPU 90% ๋„˜์€ ํŒŒ๋“œ ๋ˆ„๊ตฌ์•ผ?โ€ ๊ฐ™์€ ๋ฌธ์žฅ์„ ๊ณง๋ฐ”๋กœ PromQL๋กœ ๋ฐ”๊ฟ”์ฃผ๋Š” CLI. ์ƒˆ๋ฒฝ ์˜จ์ฝœ ์š”์› ์‚ด๋ฆฌ๋Š” ๋„๊ตฌ.
hello-ebpf-demo
์ปค๋„ ํŠธ๋ ˆ์ด์‹ฑ / eBPF
hello-ebpf-demo eBPF๋กœ ์ปค๋„ ๋ ˆ๋ฒจ ์ด๋ฒคํŠธ๋ฅผ ์ถ”์ ํ•˜๊ณ  Go ๋กœ๋”๋ฅผ ํ†ตํ•ด ์œ ์ € ๊ณต๊ฐ„์œผ๋กœ ์ „๋‹ฌ. ์„ฑ๋Šฅ/๋ณด์•ˆ ๊ฐ€์‹œ์„ฑ ํ™•๋ณด ๋ชฉ์ .
LLM Observability Stack
Datadog + OTel + Llama3 + Grafana
datadog-llm-workshop LLM ํ˜ธ์ถœ ์ฒด์ธ์„ ๊ทธ๋ƒฅ โ€œAI ๋งˆ๋ฒ•โ€์œผ๋กœ ๋‘์ง€ ์•Š๊ณ , ์ง€์—ฐ / ํ† ํฐ ๋น„์šฉ / RAG ๊ฒฝ๋กœ / ์‹คํŒจ ์ง€์ ์„ ์ „๋ถ€ ๊ฐ€์‹œํ™”.
Hybrid MLOps Platform hybrid-mlops-demo ์˜จํ”„๋ ˆ๋ฏธ์Šค + ํด๋ผ์šฐ๋“œ ํ˜ผํ•ฉ ML ํŒŒ์ดํ”„๋ผ์ธ. Airflow, MLflow, Ray Serve(GPU), EKS๊นŒ์ง€ ํ•œ ์›Œํฌํ”Œ๋กœ์šฐ๋กœ ๋ฌถ์–ด์„œ ํ•™์Šต/์ถ”๋ก /๋ชจ๋‹ˆํ„ฐ๋ง.

ํ•™์Šต ๊ธฐ๋ก

๋ถ„์•ผ Repo ์„ค๋ช…
Terraform / IaC ์‹คํ—˜์‹ค terraform-playground Terraform์œผ๋กœ ๋„คํŠธ์›Œํฌ/ํด๋Ÿฌ์Šคํ„ฐ ๊ตฌ์„ฑ, ๋ฉ€ํ‹ฐํ™˜๊ฒฝ ํŒจํ„ด ์‹คํ—˜.
Kubernetes ์‹คํ—˜์‹ค kubernetes-playground ๋„ค์ž„์ŠคํŽ˜์ด์Šค ์ „๋žต, ArgoCD ๋™๊ธฐํ™” ํŒจํ„ด, ๋ฉ€ํ‹ฐํด๋Ÿฌ์Šคํ„ฐ ์šด์˜ ๋ฐฉ์‹ ๊ฒ€์ฆ.

Pinned Loading

  1. promql-assistant-cli promql-assistant-cli Public

    Python

  2. datadog-llm-workshop datadog-llm-workshop Public

    Datadog Summit Seoul 2025

    Python 1

  3. infra-terraform infra-terraform Public

    Forked from 2025-demo-01/infra-terraform

    HCL

  4. platform-argocd platform-argocd Public

    Forked from 2025-demo-01/platform-argocd

    Shell

  5. svc-trading-api svc-trading-api Public

    Forked from 2025-demo-01/svc-trading-api

    Go

  6. hybrid-mlops-demo hybrid-mlops-demo Public

    Python 1 1