Jennifer Sikos PhD

Senior Engineering Manager

Jen leads the Platform, ML, and MLOps teams at Legion Intelligence. Her work centers on building scalable, production-grade AI systems and advancing evaluation frameworks for agentic workflows. Her background that spans academic research and applied engineering, specifically in the fields of Machine Learning and Natural Language Processing. Prior to Legion, she served as Director of Engineering at Textio, leading the generative AI transformation of the company's core product and building LLM evaluation pipelines and model quality metrics at production scale. Earlier in her career, Jen conducted NLP research for defense and intelligence clients, serving as Principal Investigator on multiple SBIR contracts.

No items found.

Back to blog

No items found.

Back to In The News

10 minute read

ai-agent-evaluation-building-an-evaluation-platform-that-scales

This is some text inside of a div block.

AI Agent Evaluation: Building an Evaluation Platform That Scales

Teams deploy agents faster than they can test them. A single prompt change can silently degrade three agents while improving one. Here's how immutable datasets, purpose-driven metrics, and vendor-agnostic design make agentic evaluation reproducible at scale.

Jennifer Sikos PhD

https://legionintel.com/team/

Back to Command Papers