Post by Tim Kellogg / Redsky

i haven't fully dug in yet, i'm imagining that they're doing a fair bit of model merging of MoE experts

jul 9, 2025, 3:11 pm • 0 0

Replies

This seeems limited - who has enough data to train a MoE? Even if you merge, I assume there is limit until you lose quality?

jul 9, 2025, 3:28 pm • 1 0 • view

if each model is effectively a black box container, it ends up being a self-contained (mini) network API the weighting if any output conflicts would be interesting (probably some level of "trust" measure)

jul 9, 2025, 3:15 pm • 1 0 • view

how is it “integrated” though? select the best experts and merge them?

jul 9, 2025, 3:16 pm • 0 0 • view

One example is probably like the architecture of Polaris, which consists of multiple domain-specific agents to support the primary agent that is trained for nurse-like conversations

jul 9, 2025, 3:21 pm • 0 0 • view