
Automated TMF Data Mapping: 1,300+ Tables in 20 Minutes
Built AI-powered ETL platform that maps 1,300+ legacy telco tables to TMF standard objects with 10-20 minutes of processing time per object mapping, eliminating months of manual work by cross-functional teams.
Achieved 60% mapping quality improvement by combining deterministic evaluation with LLM intelligence, proving agents alone aren't enough for enterprise accuracy.
Delivered immediate customer retention value by enabling rapid BSS application development without replacing existing Salesforce infrastructure.
Telecommunications companies sit on decades of legacy data trapped in systems like Salesforce, SAP, and custom databases. Mapping this data to industry standards like TMF (TeleManagement Forum—the global telecommunications standards body that defines data models and APIs for BSS/OSS systems) takes months or years of manual work across thousands of tables and typically requires cross-functional teams of domain experts, data engineers, and business analysts. This blocks AI adoption, system modernization, and competitive BSS application development.
Telecommunications companies operate on infrastructure built over decades. A typical telco runs Salesforce with 1,300+ custom tables, SAP systems with proprietary schemas, and homegrown databases with zero documentation.
Modernizing these systems requires mapping every field to industry standards like TMF (TeleManagement Forum). This process traditionally takes 6-18 months of manual work by domain experts who understand both the legacy schema and the target standard.
The business impact is severe. CloudSense customers were considering churn because manual integration work was too slow and expensive. Without automated mapping, telcos can't build AI-driven BSS applications, can't migrate to modern platforms, and can't compete with cloud-native competitors.
TelcoDR needed a solution that could:
- Automatically discover and map legacy schemas with secure credential management - Handle 1,300+ tables with complex relationships and custom fields - Map to TMF standards with enterprise-grade accuracy - Work on-premise to maintain data sovereignty - Scale to evolving telecommunications standards
"...the problem we're trying to solve is this horrible mess of complexity and legacy, and we've just found the holy grail of potentially cracking that problem.... ...That's really, really awesome."
Nigel Back, Head of Product, TelcoDR
Using PostgreSQL Foreign Data Wrapper (FDW), we enabled SQL queries against Salesforce and other external systems. The FDW connects through system APIs (using securely managed credentials for each data source), presenting external data as queryable PostgreSQL tables within the platform.
This approach provides a unified query interface across disparate systems. Telcos can run the platform on-premise using Docker with AWS IoT connectivity, processing data locally while leveraging cloud AI capabilities for mapping intelligence.
We integrated AWS Bedrock using Claude 4.5 Sonnet for intelligent schema interpretation and mapping suggestions. The platform architecture supports Anthropic's latest models as they become available, ensuring access to state-of-the-art capabilities as the model family evolves. The AWS infrastructure provides enterprise compliance through built-in security controls, encryption, and audit logging.
The AI analyzes database schemas, identifies relationships between tables, and proposes mappings to TMF standards based on semantic understanding of telecommunications concepts and naming patterns.
Early versions relied on LLM agents to generate and validate mappings. Quality was inconsistent. We rebuilt the system using Langfuse evaluation pipelines with deterministic checks.
Now, agents propose mappings, but deterministic rules validate them against TMF specifications, data type compatibility, and relationship constraints. This hybrid approach improved mapping accuracy by 60% and made the system debuggable and auditable, critical for regulated telco environments.
The biggest technical lesson: LLM agents are powerful but insufficient for enterprise accuracy.
Pure agent-based approaches let the model generate and validate its own work. This creates inconsistency, hallucinations, and errors that compound across 1,300+ tables. When an agent maps a field incorrectly, downstream relationships inherit the error.
Deterministic evaluation separates generation from validation. We apply AI where it provides clear value: semantic understanding of legacy schemas and intelligent mapping suggestions. The deterministic validation layer then ensures accuracy.
The platform runs on AWS using infrastructure as code (AWS CDK) with full CI/CD automation. ECS Fargate handles container orchestration with auto-scaling based on mapping workload. Aurora PostgreSQL provides the managed database layer with tenant-column-based multi-tenancy for data isolation required for telecommunications compliance.
For customers requiring data sovereignty, we deploy a lightweight Docker runtime on-premise. It connects to AWS IoT Core for secure communication with the cloud control plane. The on-premise runtime executes queries via FDW, sends schema metadata to cloud AI services for mapping suggestions, and applies results locally.
Beyond data mapping, the platform enables telcos to build modern BSS applications on top of legacy systems without migration risk.
Once legacy data maps to TMF standards, developers can build new applications against the standard API. The platform handles translation between TMF and the underlying Salesforce or SAP schema in real time.
Telcos get modern capabilities without replacing systems that represent decades of business logic. The legacy infrastructure stays in place. New applications interact through standardized TMF interfaces.
Customer churn prediction using unified data across legacy systems
Automated billing reconciliation across multiple source systems
Real-time service provisioning with legacy system integration
Compliance reporting that aggregates data from incompatible schemas
The platform runs on AWS using infrastructure as code (AWS CDK) with full CI/CD automation. ECS Fargate handles container orchestration with auto-scaling based on mapping workload. Aurora PostgreSQL provides the managed database layer with tenant-column-based multi-tenancy for data isolation required for telecommunications compliance. For customers requiring data sovereignty, we deploy a lightweight Docker runtime on-premise. It connects to AWS IoT Core for secure communication with the cloud control plane. The on-premise runtime executes queries via FDW, sends schema metadata to cloud AI services for mapping suggestions, and applies results locally. While certain metadata does flow to AWS for processing, the architecture minimizes data exposure and satisfies telco security requirements while delivering cloud AI capabilities. AWS CDK provisions complete test environments on every pull request. Automated tests validate general application management, infrastructure provisioning, and system integration against real Salesforce sandboxes with 1,300+ table schemas. Environments tear down automatically after testing. This infrastructure enabled rapid iteration during development, establishing clear deployment pipelines and measurable progress metrics.
10-20 minutes per TMF object mapping (vs 6-18 months manual)
1,300+ source tables mapped to 67+ TMF objects
60% improvement in mapping accuracy
Immediate customer retention value for CloudSense
Production-quality mapping code without rework
Zero downtime during mapping operations
The platform processes TMF object mappings in 10-20 minutes each, handling 1,300+ source tables mapped to 67+ TMF standard objects. Manual approaches required cross-functional teams working for 6-18 months for equivalent work.
Mapping quality improved 60% after transitioning from pure agent-based to deterministic evaluation. The system now catches data type mismatches, relationship errors, and TMF specification violations that agents alone missed.
TelcoDR demonstrated the platform to CloudSense customers considering churn. The ability to rapidly map legacy Salesforce data to TMF standards delivered immediate retention value. Customers who would have left due to slow manual integration work now see a path to modernization without system replacement.
The platform proves the approach works at telecommunications scale. During live demonstrations, we showed the system's ability to generate production-quality mapping code that would take developers months to write manually, with the critical advantage that the generated code is immediately usable rather than requiring extensive rework.
Deterministic evaluation is mandatory for enterprise AI accuracy. Pure agent workflows produce inconsistent results at scale. Separate generation from validation with structured checks against specifications and business rules.
Foreign Data Wrappers enable unified query interfaces across systems. FDW presents external APIs as SQL-queryable tables, providing consistent data access patterns across disparate source systems.
AWS Bedrock integration provides enterprise-grade AI capabilities. Cloud infrastructure delivers advanced LLMs with built-in security, compliance, and audit controls essential for telecommunications.
Hybrid cloud/on-premise architectures satisfy data sovereignty requirements. Deploy lightweight runtimes that process data locally while leveraging cloud AI services through secure connectivity.
Infrastructure as code with automated testing enables rapid iteration. AWS CDK provisioning of full test environments on every PR improves development velocity and system reliability.
Legacy system integration beats replacement for regulated industries. Mapping existing data to standards unlocks modern capabilities without migration risk or loss of decades of embedded business logic.
TMF standards enable interoperability across telco systems. Once legacy data maps to TMF, new applications work across any compliant system, reducing vendor lock-in and accelerating BSS development.
TelcoDR and AE Studio proved that the 'holy grail of ETL' is achievable with the right combination of AI and deterministic validation. What used to require cross-functional teams working for 6-18 months now processes in 10-20 minutes per TMF object mapping with 60% better accuracy.
The platform delivers immediate business value through customer retention while enabling long-term strategic value through rapid BSS application development on legacy infrastructure. Telecommunications companies can now modernize without the risk and cost of system replacement.
As telecommunications standards evolve and AI capabilities advance, the architecture scales to new target standards and data sources. The deterministic evaluation approach ensures quality and auditability will remain enterprise-grade as the platform grows.
Last updated: Jan 2026
Let's discuss how we can help transform your ideas into reality.