OSMP (Octid Semantic Mesh Protocol)

OSMP (Octid Semantic Mesh Protocol)

Agentic AI instruction encoding. 86.8% smaller than JSON. Inference-free decode by table lookup. 342 opcodes, 26 namespaces, three conformant SDKs. Confirmed over LoRa mesh radio. Apache 2.0 with express patent grant.

Category
访问服务器

README

Cloudless Sky

Agentic AI mesh without the cloud.

OSMP (Octid Semantic Mesh Protocol) is an open encoding standard for agentic AI instruction exchange. It works across any channel — from a 51-byte LoRa radio packet to a high-throughput cloud inference pipeline — using the same grammar, the same dictionary, and the same decode logic.

No cloud required. No inference at the decode layer. No central authority.

<p align="center"> <img src="assets/octid-openclaw.png" alt="Octid and the OpenClaw community" width="480"> </p>


The Problem

When AI agents communicate in JSON over HTTP, the cost compounds at every hop.

{"action": "move", "agent": "BOT1", "waypoint": "WP1", "priority": "urgent"}

82 bytes of JSON envelope before you put any content in. Requires tokenization. Requires inference to parse. Fails completely at the 51-byte LoRa minimum payload.

The Solution

R:MOV@BOT1:WPT:WP1↺

21 bytes. Decode is a table lookup. Fits a single LoRa packet at maximum-range spreading factor. No inference required at the receiving node. The LLM's existing NL→structured output capability handles the translation. The system prompt supplies the SAL grammar, the ASD, and examples from the canonical test vectors. No new tooling required on the generating side. What OSMP changes is the output format, and the decode layer: the receiving node does a table lookup, not inference.


Integration Path

OSMP replaces JSON as the agent instruction serialization format. An LLM that currently outputs:

"If heart rate at node 1 exceeds 120, assemble casualty report
 and broadcast evacuation to all nodes."

→ 100 bytes. Requires tokenization. Requires inference to parse.
  Fails completely at 51-byte LoRa minimum payload.

can be configured via system prompt to output:

H:HR@NODE1>120→H:CASREP∧M:EVA@*

→ 35 bytes. 65% reduction. Decode is a table lookup.
  Fits a single LoRa packet at maximum-range spreading factor.
  No inference required at the receiving node.

Measured Performance

<table> <tr> <td align="center"><h1>86.8%</h1><b>byte reduction vs JSON</b><br>29 real-world vectors from 5 frameworks</td> <td align="center"><h1>70.5%</h1><b>byte reduction vs protobuf</b><br>compiled schemas, protoc 3.21.12</td> <td align="center"><h1>76.0%</h1><b>fewer tokens (GPT-4)</b><br>cl100k_base tokenizer, 1,809 → 434</td> </tr> </table>

Compression claims are measured, not estimated. The 29-vector SAL vs JSON benchmark uses real wire-format payloads from MCP, OpenAI, Google A2A, CrewAI, and AutoGen. Full methodology and adversarial review in the whitepaper.


Quick Start

MCP Server (any AI client)

pip install osmp-mcp
osmp-mcp

Installs the MCP server with three domain corpora (ICD-10-CM, ISO 20022, MITRE ATT&CK) included. Connect from Claude Code (claude mcp add osmp -- osmp-mcp), Claude Desktop, Cursor, or any MCP-compatible client. Listed on the MCP Registry as io.github.Octid-io/osmp.

From source

git clone https://github.com/octid-io/cloudless-sky
cd cloudless-sky
python3 sdk/python/src/osmp.py

No dependencies beyond Python standard library.

npm

npm install osmp-protocol

Go

import "github.com/octid-io/cloudless-sky/sdk/go/osmp"

Why OSMP Is Different From Every Other Agent Protocol

Protocol Transport Offline Compression Inference-Free Decode
MCP (Anthropic) HTTP/JSON
A2A (Google/Linux Foundation) HTTPS/JSON
ACP (IBM) REST/HTTP
OSMP Any channel 86.8% vs JSON

MCP, A2A, and ACP are framework-layer protocols. OSMP is an encoding-layer protocol. It operates beneath any of them. Two agents using different frameworks that share the OSMP grammar and dictionary can communicate with no modification to either framework.


SDK Status

All three SDKs are independently verified against the canonical test suite. Wire compatibility is confirmed: Python, TypeScript, and Go produce field-for-field identical decode results across every namespace, every operator, and every edge case documented in the spec. D:PACK/BLK resolve is verified across all 124,215 domain codes (74,719 ICD-10-CM + 47,835 ISO 20022 + 1,661 MITRE ATT&CK) in all three SDKs.

SDK Target Conformance Notes
Python Reference implementation CONFORMANT Single source of truth for all SDK behavior
TypeScript OpenClaw / web agent integrations CONFORMANT fzstd (82KB, pure JS, zero native deps) for D:PACK/BLK
Go PicoClaw / constrained hardware CONFORMANT ASD compiled-in; D:PACK/BLK via klauspost/compress/zstd (3.1MB binary)
MCP Server Any MCP-compatible AI client pip install osmp-mcp 9 tools: encode, decode, compound_decode, lookup, validate, discover, resolve, batch_resolve, benchmark

Benchmark

$ python3 sdk/python/src/osmp.py

OSMP BENCHMARK — Cloudless Sky Protocol v1.0
SDK: Python (reference)

  ID         NL Bytes OSMP Bytes  Reduction
  ✓ TV-001         43         10      76.7%
  ✓ TV-013        100         35      65.0%
  ✓ TV-015        101         30      70.3%
  ...

  Mean reduction: 60.8%
  CONFORMANT ✓  (mean 60.8% vs 60.0% threshold)

Run it yourself. The numbers are real and independently reproducible across all three SDKs.


What OSMP Delivers Today

Everything here is operational from the floor ASD without MDR, cloud access, or additional tooling.

Instruction encoding across all 26 standard namespaces — 342 opcodes drawn from authoritative sources: IEC 61850 (energy), ICD-10/SNOMED CT/CPT (clinical), ISO 20022/FIX/SWIFT (financial), ISO 10218-1:2025 (robotics), FEMA ICS/NIMS (emergency management), BDI/PDDL/HTN (cognitive AI), OpenAI/Anthropic APIs (model operations).

Four AI-native namespaces — J (Cognitive Execution State), Q (Quality/Evaluation/Grounding), Y (Memory + Retrieval), Z (Model/Inference Operations). No prior agent communication protocol defines these. They encode what agents do internally, not just what they communicate between themselves. The J→Y→Z→Q chain encodes the full AI cognitive pipeline as a single transmissible SAL instruction sequence, decodable by ASD lookup without neural inference.

Two-tier corpus compression -- D:PACK applies OSMP encoding followed by lossless dictionary-based compression for at-rest storage. Two profiles: D:PACK/LZMA (full-corpus, companion device) and D:PACK/BLK (zstd block-level, microcontroller target with single-code random access in 38KB SRAM). D:UNPACK retrieves semantic content by ASD lookup against the SAL intermediate representation. Three complete D:PACK/BLK dict-free builds demonstrate the architecture at scale: the CMS FY2026 ICD-10-CM code set (74,719 clinical descriptions, 5.4MB raw) produces a 477KB binary (91.4% total reduction), the ISO 20022 eRepository (47,835 unique financial definitions extracted from 66,956 source rows, 8.7MB raw) produces a 1.2MB binary (86.5% total pipeline reduction), and the MITRE ATT&CK Enterprise v18.1 knowledge base (1,661 techniques, malware, tools, and threat groups, 82KB raw) produces a 20KB binary (75.3% reduction). All artifacts fit in ESP32 flash and enable edge-local domain code resolution without network access.

Layer 2 accessors — Bracket-enclosed slot values drawn from external open-ended registries: H:ICD[R00.1], H:SNOMED[concept_id], H:CPT[99213], K:ISO[MessageDefinitionIdentifier]. Layer 2 slot values are exempt from the single-character encoding rule and are fully functional today with native code values in brackets. MDR corpus compression increases storage density of these codes; it does not gate the accessor pattern itself.

Three MDR domain corpora shipped — ICD-10-CM (74,719 clinical codes, H namespace), ISO 20022 (47,835 financial definitions, K namespace), and MITRE ATT&CK Enterprise v18.1 (1,661 entries, S namespace). All three are D:PACK/BLK dict-free binaries resolvable by all three SDKs without network access. See D:PACK section above for sizes and reduction figures.

R:ESTOP hard exception — executes immediately on receipt of any single fragment, regardless of loss tolerance policy, fragment completeness, or I:§ authorization state. Asymmetric harm: unnecessary stop is recoverable; failure to stop a physical agent in emergency is not. This is intentional and documented in spec §8.2. Do not modify.

Overflow Protocol — Tier 1 (single packet, ≤51 bytes at LoRa SF12), Tier 2 (sequential burst), and Tier 3 (DAG decomposition for conditional branches and dependency chains). Three loss tolerance policies: Φ (Fail-Safe), Γ (Graceful Degradation, default), Λ (Atomic -- required for K and H namespace instructions with irreversible consequences). Tier 3 decomposes compound SAL instructions into a directed acyclic graph of executable units with dependency pointers, resolves execution order via topological sort, and applies loss tolerance to the maximal resolvable subgraph under partial receipt. Multi-parent dependencies (diamond joins) use a FLAGS bit 3 extended dependency bitmap. Fragment header stays at 6 bytes across all three tiers.

BAEL floor guarantee — the protocol never makes an instruction longer than its natural language input. When the encoded form exceeds the natural language form, BAEL selects NL_PASSTHROUGH and transmits the original with a flags bit. Compression is never negative.

FNP handshake — Two-message capability advertisement + acknowledgment (40B + 38B = 78 bytes total). Negotiates dictionary alignment, namespace intersection, and channel capacity in two LoRa packets. Implemented in all three SDKs with byte-identical wire format. Channel capacity negotiation selects the LCD of both nodes, so the mesh scales within the most constrained link.

ADP dictionary synchronization — The ASD Distribution Protocol keeps dictionaries aligned across nodes after initial FNP handshake. Delta-based updates decompose dictionary changes into independently parseable units, each carrying a version pointer and a tripartite resolution flag (additive, superseding replacement with mandatory retransmission, or deprecation). Nodes apply deltas as they arrive and operate in a partially updated but internally consistent state during synchronization. Instructions referencing opcodes whose defining delta has not yet arrived are held in a semantic pending queue and resolved on receipt. MAJOR.MINOR version signaling detects breaking changes. The guaranteed minimum operational vocabulary floor ensures every node can decode baseline instructions regardless of synchronization state. Implemented in the Python SDK with 69 tests passing.

Sovereign namespace extensionΩ: (U+03A9) allows any implementing party to define proprietary namespace extensions without central approval or registration.


What Requires Future Work

C++ firmware-level OSMP nodes — OSMP integration with Meshtastic via the Python SDK and Meshtastic Python library is operational today (see CONTRIBUTING.md). The C++ contribution target is a firmware-level encoder/decoder enabling ESP32 and nRF52 Meshtastic devices to operate as sovereign OSMP nodes without a companion device, with the ASD compiled into flash.

Additional MDR namespaces — SNOMED CT, RxNorm, LOINC, and other open registries are future namespace targets.


Architecture

Component Function
SAL — Semantic Assembly Language Domain-specific symbolic instruction format
ASD — Adaptive Shared Dictionary 342-opcode version-pinned compression dictionary
FNP — Frame Negotiation Protocol Capability negotiation and session handshake
ADP — ASD Distribution Protocol Dictionary delta synchronization across nodes
SNA — Sovereign Node Architecture Autonomous edge node, air-gapped operation
TCL — Translational Compression Layer Semantic serialization and transcoding
OP — Overflow Protocol Message fragmentation, priority, graceful degradation
BAEL — Bandwidth-Agnostic Efficiency Layer Adaptive encoding across any channel capacity

Namespaces

A  Agentic/OSMP-Native     M  Municipal Operations
B  Building/Construction   N  Network/Routing
C  Compute/Resource Mgmt   O  Operational Context/Environment
D  Data/Query/File Transfer P  Procedural/Maintenance
E  Environmental/Sensor    Q  Quality/Evaluation/Grounding  ← AI-native
F  Federal/Regulatory      R  Robotic/Physical Agent
G  Geospatial/Navigation   S  Security/Cryptographic
H  Health/Clinical         T  Time/Scheduling
I  Identity/Permissioning  U  User/Human Interaction
J  Cognitive Exec State ← AI-native  V  Vehicle/Transport Fleet
K  Financial/Transaction   W  Weather/External Environment
L  Logging/Audit/Compliance X  Energy/Power Systems
                            Y  Memory + Retrieval     ← AI-native
                            Z  Model/Inference Ops    ← AI-native
                           Ω:  Sovereign Extension

Example Instructions

# Environmental query
EQ@4A?TH:0
→ "Node 4A, report temperature at offset zero."  76.7% reduction

# Emergency broadcast
MA@*!EVA
→ "Broadcast evacuation to all nodes."  81.8% reduction

# MEDEVAC threshold alert
H:HR@NODE1>120→H:CASREP∧M:EVA@*
→ "If heart rate exceeds 120, assemble CASREP and broadcast evacuation."  65.0% reduction

# Clinical with ICD-10 Layer 2 accessor (functional today)
H:HR<60→H:ALERT[BRADYCARDIA]∧H:ICD[R00.1]
→ "If heart rate below 60, alert bradycardia with ICD-10 code."

# Two-tier corpus encoding
D:PACK@CORPUS∧D:UNPACK[query]
→ Encode corpus for at-rest storage; retrieve by ASD lookup without decompression.

# Atomic financial instruction
K:PAY@RECV↔I:§→K:XFR[AMT]
→ "Execute payment iff human confirmation received, then transfer asset."  70.3% reduction

# Internet-uplink capability-addressed routing
∃N:INET→A:DA@RELAY1
→ "Route to any node with internet uplink, delegate to relay."

# AI cognitive pipeline
J:GOAL∧Y:SEARCH∧Z:INF∧Q:GROUND
→ "Declare goal, retrieve from memory, invoke inference, verify grounding."

Repository Structure

cloudless-sky/
  protocol/
    spec/           <- OSMP-SPEC-v1.md -- authoritative protocol specification
    grammar/        <- SAL-grammar.ebnf -- formal grammar (EBNF)
    test-vectors/   <- canonical-test-vectors.json -- conformance suite
  sdk/
    python/         <- Reference implementation
    typescript/     <- OpenClaw/web SDK (fzstd for D:PACK/BLK)
    go/             <- PicoClaw/constrained hardware SDK
  mcp/
    server.py       <- MCP server (pip install osmp-mcp)
    server.json     <- MCP Registry descriptor
  mdr/
    icd10cm/        <- CMS FY2026 ICD-10-CM (74,719 codes, 477KB)
    iso20022/       <- ISO 20022 eRepository (47,835 elements, 1.2MB)
    mitre-attack/   <- MITRE ATT&CK Enterprise v18.1 (1,661 entries, 20KB)
  benchmarks/
    sal-vs-json/    <- 29-vector framework benchmark, four-way comparison, grammar analysis
  docs/
    SAL-efficiency-analysis.md  <- Whitepaper v2.0: structural efficiency analysis
    adr/            <- Architecture Decision Records
  tests/
    tier1/          <- Unit tests per SDK + D:PACK/BLK resolve tests
    tier2/          <- Cross-SDK wire compatibility
    tier3/          <- Tier 3 DAG decomposition tests (Python, TypeScript, Go)

Contributing

See CONTRIBUTING.md. The spec is authoritative. All SDK implementations are validated against the canonical test vectors. A conformant implementation must achieve ≥60% mean UTF-8 byte reduction with zero decode errors.

Meshtastic integration via the Python SDK and Meshtastic Python library is operational today with no additional code required. See CONTRIBUTING.md for details.

Wanted: C++ firmware-level encoder/decoder (ESP32/nRF52 sovereign nodes), Kotlin/Swift mobile SDKs.


Patent Notice

The OSMP architecture is covered by pending US patent application OSMP-001-UTIL (inventor: Clay Holberg, priority date March 17, 2026). A continuation-in-part (OSMP-001-CIP) extends coverage to cloud-scale AI orchestration, non-RF channels, and the AI-native namespace architecture. Apache 2.0 includes an express patent grant for implementations of this specification. See PATENT-NOTICE.md.


License

Apache 2.0 — see LICENSE.


Cloudless Sky is a project of Octid.

推荐服务器

Baidu Map

Baidu Map

百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。

官方
精选
JavaScript
Playwright MCP Server

Playwright MCP Server

一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。

官方
精选
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。

官方
精选
本地
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。

官方
精选
本地
TypeScript
VeyraX

VeyraX

一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。

官方
精选
本地
graphlit-mcp-server

graphlit-mcp-server

模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。

官方
精选
TypeScript
Kagi MCP Server

Kagi MCP Server

一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。

官方
精选
Python
e2b-mcp-server

e2b-mcp-server

使用 MCP 通过 e2b 运行代码。

官方
精选
Neon MCP Server

Neon MCP Server

用于与 Neon 管理 API 和数据库交互的 MCP 服务器

官方
精选
Exa MCP Server

Exa MCP Server

模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。

官方
精选