Junghwan Park directs the Division of Healthcare Data Development at Korea's Ministry of Health and Welfare, where he leads national strategy for health data and AI. He earned his PhD in public health at UC San Diego, where he built just-in-time adaptive interventions for physical activity using machine learning and control engineering. He also designs and ships AI and agent systems himself — from local-LLM research harnesses to deployed full-stack tools.
2020–2024 PhD, Public Health (Health Behavior) — UC San Diego / San Diego State University Advisor: Eric Hekler. Dissertation: Optimizing Adaptive Interventions for Physical Activity.
2002–2011 BS, Bio and Brain Engineering (Bioinformatics) — Korea Advanced Institute of Science and Technology (KAIST)
2025 Junghwan Park led the Planning Team for the 15th APEC High-Level Meeting on Health and the Economy, held in Seoul on 16 September 2025 under the theme "Connect, Innovate, Prosper: Building a Healthy, Smart and Aging-Responsive Society." Chaired by Korea's Minister of Health and Welfare, the meeting adopted a Joint Ministerial Statement among APEC economies addressing population aging, sustainable health systems, digital health and AI, mental health, supply-chain resilience, and emergency preparedness. Park steered negotiation of that statement and organized the third Senior Officials' Meeting (SOM-3), carrying the multilateral coordination that produced a consensus regional health-and-economy agenda.
2017–2022 Junghwan Park planned what became Korea's national bio big-data initiative — popularly the One Million Genomes Project, now formally the National Integrated Bio Big Data project (국가통합바이오빅데이터 구축사업). His planning helped lay the groundwork for a multi-ministry national program that, in its full phase launched in December 2024, links genomic, clinical, and public data from consenting citizens, aiming to assemble bio big data from one million people. Led jointly by the Ministry of Health and Welfare, the Ministry of Science and ICT, the Ministry of Trade, Industry and Energy, and the Korea Disease Control and Prevention Agency, it is a foundation for precision medicine.
2024–2025 In 2024–2025, Junghwan Park led a cross-ministerial team spanning twelve government ministries that shaped Korea's response to U.S. pharmaceutical trade actions and its national biosecurity posture, including biopharmaceutical supply chains and bio-clusters. Given the topic's sensitivity, it is described here only in brief.
2026– Beginning in 2026, Junghwan Park led a team of planners and engineers to develop the "AI Basic Health Strategy," a framework intended to provide universal access to AI-driven health initiatives. The strategy spans the full arc from research and development through commercial industry, treating AI in health as a continuum rather than a set of isolated projects. As Director of the Division of Healthcare Data Development, Park brought together policy planning and technical expertise to shape the framework. As a strategy still under development, its detailed content is expected to be published in the future.
2017–2020 Junghwan Park established Korea's National Medical Big Data Linkage Platform (2017–2020), connecting national health-data holders so that previously siloed datasets could be analyzed together for research and policy. The initiative grew into the operating Healthcare Big Data Integrated Platform (보건의료 빅데이터 통합 플랫폼), which links bodies including the Disease Control and Prevention Agency, the National Health Insurance Service, HIRA, the National Cancer Center, and Statistics Korea. It is built around privacy: data is converted to a common model and only analysis results, not raw records, leave the system.
2017–2020 From 2017 to 2020, Junghwan Park led the drafting of Korea's first health-information de-identification guideline — the methodology for removing or masking identifying elements so that sensitive health records could be used for research while protecting privacy. That foundational work seeded what is now the Ministry of Health and Welfare's Healthcare Data Utilization Guideline (보건의료데이터 활용 가이드라인), maintained by its Healthcare Data Promotion Division and periodically revised, most recently in December 2025. The guideline gives hospitals, agencies, and researchers a common standard for handling health data under Korea's privacy regime.
2017–2018 Earlier in his career (2017–2018), Junghwan Park was responsible for regenerative-medicine policy at the Ministry of Health and Welfare, where he drafted an early $1 billion R&D plan and helped lay the legislative groundwork for advanced therapies. That work became part of the foundation for what later grew into the Advanced Regenerative Bio Act (첨단재생의료 및 첨단바이오의약품 안전 및 지원에 관한 법률), in force since 2020, and for the institutions that now carry the field forward — including the Regenerative Medicine Advancement Foundation (재생의료진흥재단). The realized law and programs are the work of many hands over the years since.
2012–2015 From 2012 to 2015, Junghwan Park operated Korea's Social Security Information System and ran a welfare fraud-prevention program credited with an estimated $2.5 billion in savings, alongside automated eligibility verification. Now operated by the Korea Social Security Information Service (한국사회보장정보원) as 행복이음, the system is the integrated backbone of national welfare administration: it consolidates welfare information by individual and household across agencies, serving seven ministries and around 170 welfare programs, and verifies eligibility while linking income and asset data. Park's role was operational — running the program that delivered those results during his tenure.
2025 A public policy-and-engineering project that measures whether South Korea's public-sector websites are reachable by AI crawlers. If you ask a chatbot about a government program, it can only answer accurately if it can read that ministry's site, yet many block AI crawlers via robots.txt. Are We Open audits 734 institutions, from ministries and courts to local governments and public agencies, scoring each out of 100 across six categories: robots.txt rules, real content accessibility, structured data, technical accessibility, llms.txt, and sitemap. It goes past static checks by rendering pages in a real browser, following the actual press-release and notice links, and testing whether those final URLs are blocked. Results are published openly with downloadable data.
2024 Park's 2024 UCSD/SDSU PhD dissertation (Public Health, Health Behavior; chair Eric Hekler) is titled "Optimizing Just-In-Time Adaptive Interventions: Incorporating Idiographic, Dynamic Predictions to Support Physical Activity." Built on the JustWalk system-identification trial, it tests whether individualized "just-in-time" states — moments when a person reliably walks more if prompted — can be identified per person using within-individual Bayesian modeling rather than population-level statistics. The reported finding: such states were identifiable for 91% of participants with sufficient data (40 of 44), justifying a follow-on control optimization trial.
2020–2024 JustWalk JITAI is Park's PhD trial, published as a protocol in JMIR Research Protocols (2023). It is a system-identification experiment designed to test whether "just-in-time" states — moments when a person reliably increases walking if prompted — can be identified empirically rather than assumed. Forty-eight physically inactive adults (aged 25 and older, wearing a Fitbit Versa 3) went through a 10-day baseline plus a 260-day intervention in which adaptive daily step goals and up to four daily notifications were varied using control-engineering signal designs, so that intervention dynamics could be modeled within each individual.
2022–2025 YourMove is a published protocol (JMIR Research Protocols, 2025) for a control optimization trial embedded within a randomized controlled trial, testing an individualized, adaptive physical-activity digital health intervention. The design enrolls 386 inactive adults aged 25–80, randomized 1:1 (stratified by sex) to a control-systems-driven intervention or an active control, over 12 months. Its distinguishing feature is a three-phase structure — baseline, an open-loop system-identification phase that fits each person's own dynamic model, then a closed-loop controller that perpetually adapts goals and rewards. Primary outcome: minutes per week of moderate-to-vigorous physical activity at 12 months.
2025 reasoning-mini asks whether a swarm of sub-1B-parameter language models, run locally, can climb toward the reasoning ability of a frontier model on a benchmark ladder. Each experiment is pinned between three anchors graded on the same seeded problems: a single-model floor, the orchestration under test, and the frontier-model ceiling. The headline result is that diverse-prompt self-consistency lifts GSM8K accuracy from a 56% single-model baseline to 84%, closing 64% of the gap to the frontier model — and that every multi-agent debate, verifier, or decomposition scheme underperformed that simple champion.
2025 Neural Architect tests a provocative idea: can a large language model design and weight-tune a small neural network by reasoning, with no backpropagation? The LLM works through an MCP tool server — inspecting MNIST, proposing layers, assigning neuron roles, setting weights, and reading back evaluations — under a deliberately harsh constraint: maximize accuracy per hidden neuron. Every decision and tool call is logged to SQLite for a fully auditable, explainable build. In practice the strongest gradient-free 8-neuron model reached about 92% test accuracy, essentially matching the backprop baseline at that size.
2025 llmwiki is a personal, multi-domain knowledge base that LLM agents build and maintain from curated sources, instantiating the "LLM wiki" idea at passage granularity. The user curates source material and asks questions; agents write, link, and reorganize the wiki. Its defining choice is that the atomic page is a single passage (roughly one source paragraph), each carrying both the original quote and a paraphrased gist, plus an embedding sidecar and entries in an auto-managed cross-domain ontology. A four-layer pipeline moves material from immutable raw sources through screening and deep-read staging into the structured wiki.
2023–2024 At UC San Diego's Design Lab, within the broader Agile Electrification initiative, Junghwan Park led a sub-project (2023–2024) on machine-learning modeling of home energy-load growth and on improving homeowners' energy literacy. Agile Electrification pursues clean, distributed energy for California homes through design-thinking sprints that produce open-source tools. A central obstacle for the modeling work was utility-bill access; one open-source result predicts a household's annual energy use from building characteristics and weather instead of bills, supporting electrification-impact analysis without that private data.
2005 HExDB (Human EXon DataBase for Alternative Splicing Pattern Analysis) is Park's earliest publication — a 2005 bioinformatics paper in Genomics & Informatics (Park J, Lee M, Bhak J), from his KAIST years. It is a human exon database built to support two practical tasks: designing primers to amplify exons from cDNA, and understanding how alternative splicing changes open reading frames. The database was assembled by integrating three sources — the computationally predicted AltExtron exon set, Ensembl cDNA annotation, and a then-recent Affymetrix genome tiling array — to let researchers examine exon composition and splicing variation across the human genome.
2025–2026 A self-hosted "second brain" backend for both people and LLM agents, exposed as a single REST API and running live in production. One PostgreSQL instance carries everything — relational tables, a graph layer (Apache AGE), and vector embeddings (pgvector) — so semantic search and graph traversal share one source of truth. Nodes and typed edges store notes, papers, and syntheses; local embeddings power meaning-based search; service-account Bearer tokens enforce strict cross-account isolation. Written in Go, deployed via Docker on a single VM behind automatic TLS, with batch endpoints, incremental sync, shareable web views of nodes, and a daily autonomous self-improvement loop.
2025–2026 A self-completing research feedback system that grows toward a queryable corpus of biomedical literature. Snowball search pulls candidate papers from PubMed Central, preprint servers, and Semantic Scholar's citation graph; a two-layer deduplicator (identifier hard-skip plus semantic abstract similarity) prevents re-parsing; a frozen local pipeline writes one English reading note per paper using a local Gemma model. Everything lands in a shared graph store as nodes and edges with embeddings. Periodically, an Opus orchestrator synthesizes topic clusters into summaries, gap maps, and review drafts, independently re-verifying every citation and editing source notes when discrepancies surface.
2025 A hierarchical multi-agent framework that thinks continuously instead of waiting for prompts, built to run on small local models (roughly 7B–30B parameters). A non-agent FastAPI backend absorbs orchestration complexity so the model only writes plain text with simple markers. Three agent tiers — a persistent Telegram proxy, ephemeral orchestrators that reason and self-continue, and atomic workers for search, PDF reading with OCR, and computation — pass state through the database, not memory. It hosts multiple isolated agent identities that can join shared group chats, with anti-loop cooldowns, consensus voting, stall detection, and summary compaction keeping long autonomous reasoning chains stable.
2025 A local-first, reference-crawling RAG system for individual researchers. Give it one seed paper and it recursively follows that paper's citations N levels deep — extracting bibliographic references from PDF text with an LLM, deduplicating them, resolving them through Semantic Scholar, downloading open-access PDFs, and indexing everything into a personal vector knowledge base you can then chat with. Privacy is the point: papers, metadata (SQLite), and the vector store (ChromaDB) all live on your machine, and inference runs locally. A modular design splits an orchestrating Main API from a dedicated vector-store API, served behind a simple browser UI.
2025 An ambitious agentic framework for persistent, evolving, socially-aware AI agents backed by PostgreSQL long-term memory, built on the principle "the model is a commodity; the database is the soul." Agents are transient personas in a tiered hierarchy that inherit and pass down knowledge across generations. Distinctive design ideas include Social Access Control (a four-tier permission model for what agents may share), a dialectical Thesis-Antithesis-Synthesis knowledge engine, and an immutable SQL gateway that intercepts every query to inject soft-delete and ownership filters and block destructive statements. It targets local model inference, a Telegram interface, and a telemetry dashboard. A substantial scaffold exists, but the system is partway through implementation.
2025–2026 An autonomous research agent that runs end-to-end secondary-data studies on public mobile-health datasets without human steering. Each cycle moves through a ten-phase state machine: pick a topic from a graph-backed "second brain," frame a hypothesis from dataset metadata alone, pre-register it before opening a single data row, download from an allow-listed source through a process-level network jail, split a holdout, explore, lock the analysis script, validate on the holdout under hash-gated single-run enforcement, adversarially critique the result, and synthesize a claim-level finding back into the knowledge graph. It is CPU-only, disk-capped, and built so the holdout cannot be peeked.
2025–2026 A mindmap-plus-todo tool built so that an AI agent and a human can edit the same tree. The backend is FastAPI with async SQLAlchemy and PostgreSQL; the web client is React, Vite, and React Flow with automatic layout. Every node is a single checkable item with children, and a parent's state aggregates its descendants as checked, mixed, or unchecked. Its defining choice is a capability-URL model: the mindmap's identifier is itself the credential, so anyone holding the link can read and edit without a token. Agents use the REST API; people use a tokenless editor link. Every change is appended to an audit log.
2025 A conversational research assistant that runs entirely on a local LLM to search, analyze, and synthesize academic papers, exporting results into an Obsidian vault. You ask a research question in natural language; it parses intent, searches PubMed, arXiv, Semantic Scholar, Google Scholar, and local PDFs, snowballs through citations, then runs a multi-stage analysis pipeline that goes well beyond summarization into critical reading, future-work-based direction setting, cross-paper gap identification, and research ideation. Distinctively, it adds a verification layer that checks model output for grounding, self-consistency, and drift, plus a daemon that researches autonomously during idle time.
2025 A tool that turns a hand-drawn sketch or a text description into a publication-quality TikZ/LaTeX figure through an automated critique loop. You hand it a photo of a napkin diagram or a prompt; a model analyzes it into a structured figure specification, generates TikZ code, and the tool compiles it with pdflatex and converts the result to an image. The model then looks at the rendered figure, scores it from one to ten with feedback, and revises until the score clears a threshold or it hits a maximum iteration count. It ships in two forms: a CLI for local runs, and a remote MCP server with OAuth and per-user sessions so it works directly inside Claude Desktop.
2025 An experiment in agent autonomy on bare-metal hardware. The user supplies power, microcontrollers, and a toolchain but imposes no goal; an LLM agent decides what to build and leaves a trail for its future sessions to continue. In its first incarnation on two Arduino Unos, the agent wrote a tiny Forth interpreter that grows a persistent dictionary in EEPROM, and got the two chips to program each other over a hand-rolled 3-wire UART, even running a genetic algorithm that evolved Forth words across the link. Its successor restarts the premise on a Raspberry Pi Pico, where seven debating personas converged on a far stranger design before the first blink.
March 2026 AI Capacity Building for the Health Workforce in Korea: From Medical AI to AI Medicine
Forum on Harnessing AI for Health Equity, Asian Development Bank (ADB), Manila
March 2026 Structural Characteristics of the Korean Health System and Digital Health Innovation
Stanford Graduate School of Business, invited lecture, Seoul
March 2026 AI Basic Health Framework
WHO Department of Digital Health and Innovation, online