Modernisation of legacy systems has historically been constrained by three factors: incomplete documentation, inaccessible source code, and shrinking institutional knowledge. Traditional delivery models treat reverse-engineering as a manual, time-intensive discovery phase — often consuming more budget than the rebuild itself.

Recent advances in agentic AI workflows and multimodal large language models introduce a different operating model: one where legacy estates can be analysed, specified, and re-platformed through automated interpretation of code, interfaces, and human expertise.

This article outlines practical patterns for migrating legacy platforms using AI agents — especially in environments where artefacts are partial, opaque, or operationally entangled.


1. Agent-Driven Legacy Code Analysis

Where source code is available, AI agents can perform structured analysis across multiple layers simultaneously.

Static codebase interpretation

Agents can:

  • Parse monolithic repositories
  • Identify bounded contexts
  • Extract domain models
  • Map service dependencies
  • Detect dead code and unused modules

This enables automated production of:

  • Functional specifications
  • Entity diagrams
  • API contracts
  • Data flow maps

Rather than reading code linearly, agents reason across the system holistically — clustering related logic and inferring business purpose.

Database schema reconstruction

Legacy logic often resides in:

  • Stored procedures
  • Triggers
  • Views
  • Batch jobs

Agents can analyse DDL + procedural SQL to derive:

  • Business rules
  • Validation logic
  • Workflow triggers
  • Data lineage

This is particularly valuable in Oracle, DB2, or SQL Server estates where application layers are thin and behaviour is database-centric.


2. UI Reverse Engineering via Screenshot Analysis

In many legacy environments, source code is incomplete, proprietary, or low-code generated. In these cases, the user interface becomes the primary observable artefact.

Multimodal LLMs allow screenshots to be converted into structured system knowledge.

Screenshot → textual system description

By analysing UI captures, agents can extract:

  • Field names and types
  • Input constraints
  • Dropdown value semantics
  • Navigation hierarchies
  • Action buttons and workflows

Example outputs:

  • Screen inventories
  • CRUD operation mapping
  • User journey diagrams
  • Role-based UI variants

This is especially effective in platforms such as:

  • Oracle APEX
  • Lotus Notes
  • PowerBuilder
  • Access applications
  • Proprietary desktop clients

Workflow inference

Sequences of screenshots or recordings enable agents to infer:

  • Process lifecycles
  • Approval chains
  • Escalation paths
  • Exception handling

This transforms visual interaction into executable system specifications.


3. Audio Capture of SME Knowledge

One of the highest-risk aspects of legacy migration is knowledge attrition — particularly when original system builders are unavailable or knowledge is scattered.

AI enables structured capture of tacit knowledge through recorded dialogue.

SME interview ingestion

Subject Matter Experts can provide verbal walkthroughs covering:

  • Business rules
  • Edge cases
  • Operational workarounds
  • Historical design decisions
  • Regulatory constraints

Audio is transcribed using speech-to-text pipelines, then analysed by LLM agents.

From conversation to specification

Agents can convert raw transcripts into:

  • Functional requirements
  • Decision trees
  • Process documentation
  • Data handling rules

They can also reconcile SME explanations against code or UI artefacts, identifying:

  • Undocumented behaviours
  • Policy deviations
  • Operational risks

This closes the institutional knowledge gap without requiring formal documentation exercises.


4. Multimodal Synthesis: Building a Unified System Model

The real leverage emerges when agents correlate multiple artefact types.

Inputs may include:

  • Source code
  • Database schemas
  • UI screenshots
  • Screen recordings
  • SME audio interviews
  • Legacy documentation

Agents can synthesise these into a unified model comprising:

  • Domain entities
  • Service boundaries
  • Workflow orchestration
  • Integration contracts
  • Non-functional constraints

This becomes the foundation for re-platforming.


5. Automated Specification Generation

Once analysed, agents can produce structured artefacts required for modern delivery pipelines:

  • OpenAPI specifications
  • Event schemas
  • Entity-relationship diagrams
  • Infrastructure requirements
  • Security classification models

Specifications can be exported into formats consumable by engineering teams or downstream AI builders.

This eliminates months of traditional discovery workshops.


6. Code Generation and Re-Platforming

With specifications in place, agent workflows can assist in rebuilding target systems.

Typical transformations include:

Legacy Source Target Platform
PL/SQL logicJava / Spring services
Forms UIReact / Angular
Batch jobsEvent-driven workers
Monolith DBService-scoped schemas

Agents can generate:

  • Service scaffolding
  • Data access layers
  • Validation rules
  • Migration scripts

Human engineers remain responsible for architecture and quality control, but delivery velocity increases materially.


7. Test Case Derivation

Legacy systems often lack formal test coverage.

Agents can derive tests from:

  • Validation logic
  • Historical defects
  • UI constraints
  • SME narratives

Outputs include:

  • Unit test suites
  • Integration scenarios
  • Regression packs
  • Synthetic data sets

This ensures behavioural parity between legacy and modernised platforms.


8. Documentation Regeneration

Documentation gaps are endemic in legacy estates.

AI agents can automatically produce:

  • System overviews
  • Runbooks
  • Support guides
  • API usage documentation
  • Operational procedures

This shifts documentation from an afterthought to a generated asset of the reverse-engineering process.


9. Additional High-Value Use Cases

Beyond direct migration, multimodal legacy analysis enables:

Compliance auditing

Agents can detect:

  • Data retention violations
  • Access control weaknesses
  • Encryption gaps

Licensing risk analysis

Identifying dependencies on:

  • Oracle
  • IBM
  • Proprietary runtimes

Supporting cost-reduction strategies.

Performance bottleneck discovery

Analysing:

  • Query inefficiencies
  • Batch processing delays
  • Concurrency conflicts

Integration mapping

Documenting undocumented interfaces between:

  • Internal systems
  • Third-party vendors
  • Government platforms

10. Delivery Model Transformation

Traditional legacy replacement programmes follow a sequence:

  1. Discovery
  2. Documentation
  3. Design
  4. Build
  5. Test

Agentic workflows compress phases 1–3 into parallel, automated processes.

This yields:

  • Shorter mobilisation timelines
  • Reduced consultancy cost
  • Faster architectural convergence

In some cases, single-team or even single-engineer rebuild spikes become viable where large programmes were previously assumed necessary.


Conclusion

Legacy migration is no longer constrained to code translation. With multimodal AI agents, organisations can reverse-engineer entire systems from fragmented artefacts — combining code analysis, UI interpretation, and institutional knowledge capture.

Screenshots become specifications. Conversations become requirements. Database triggers become domain rules.

The result is a materially accelerated path from opaque legacy estates to modern, maintainable platforms — with reduced delivery cost and lower operational risk.

As agent capabilities mature, this approach is likely to evolve from experimental practice into a standard operating model for public sector and enterprise re-platforming initiatives.