Overview
Introduction
MS Concitus provided engineering support for a US healthcare plan operations
management portal. The application supported a network of healthcare agents handling
policy enrolments, changes, terminations, and exceptions across multiple plans and
member populations.
As a Senior Software Engineer, I focused on production reliability: understanding how
agents actually used the system, finding the most painful bugs in live workflows, and
delivering fixes that kept the platform stable while respecting tight regulatory and
contractual constraints.
Background
Context
US healthcare plan operations must comply with strict rules around eligibility,
timelines, and documentation. The portal was at the centre of this:
- Agents used it daily to process member applications and policy changes.
-
Data flowed downstream to billing, claims, and regulatory reporting systems.
-
The platform’s reliability influenced satisfaction of both members and government
partners.
When agents encounter portal errors in the middle of a policy workflow, they often
have to redo work, rely on manual workarounds, or escalate to support — all of which
increase cost and risk for the health plan.
Challenge
Problem
The portal exhibited recurring production issues in critical flows such as:
- New member enrolments and plan selection.
- Policy changes (address, PCP, coverage level) mid-plan year.
- Terminations and reinstatements triggered by life events.
These bugs showed up as:
- Intermittent errors after long agent sessions.
- Incorrect handling of edge cases in policy rules.
- Data inconsistencies between UI and back-office records.
The challenge was to fix these issues without destabilising other parts of a
large, mature codebase and without interrupting daily operations.
Execution
Implementation Highlights
1) Production incident triage
-
Reviewed production logs, error reports, and support tickets to identify patterns in
failing workflows.
-
Worked with business analysts and QA to reconstruct real agent scenarios that
triggered bugs.
-
Prioritised fixes for issues that impacted core policy operations and contract KPIs.
2) Targeted bug fixes in policy workflows
-
Corrected edge-case logic in policy rule evaluations (e.g., special enrolment
periods, age cutoffs, and dependent relationships).
-
Addressed state management issues that caused stale data or duplicate submissions
during long agent sessions.
-
Fixed data mapping problems between UI forms and back-end records to keep member
data consistent.
3) Defensive coding & validation
-
Added input validation and clearer error handling around high-risk operations to
prevent invalid policy states from being saved.
-
Introduced guardrails for common agent mistakes (e.g., incomplete required fields,
conflicting changes) with actionable error messages instead of generic failures.
-
Helped refine automated tests for critical flows so regressions could be caught
earlier.
4) Safer releases in a regulated environment
-
Collaborated with QA and release teams to bundle fixes into controlled releases with
clear rollback plans.
-
Ensured that each change was traceable and well-documented for audits and
compliance reviews.
-
Assisted in monitoring post-release behaviour to confirm incident reduction.
The focus was not on rewriting the system, but on making the existing portal feel
“boringly reliable” to agents: fewer unexpected errors, clearer messages, and more
predictable behaviour in day-to-day policy work.