19% of Exam~14 questions

Ingestion

The Ingestion domain covers how data flows into the CMDB from external sources. It covers three primary mechanisms: traditional Import Sets/Transform Maps, modern Service Graph Connectors, and the underlying Identification and Reconciliation Engine (IRE). Mastering these concepts is critical for data quality, consistency, and CSDM alignment.

IRE (Identification and Reconciliation Engine)

The IRE is ServiceNow's core framework for processing all incoming CI data. Every CI that enters the CMDB — whether from Discovery, Import Sets, Service Graph Connectors, or direct API calls — passes through the IRE.

Primary Functions

  • CI Identification — matching incoming data to existing CIs in the CMDB
  • Reconciliation — resolving conflicts when multiple sources provide different values for the same attribute
  • Relationship Processing — creating and maintaining CI-to-CI relationships
  • Data Source Prioritization — determining which source is authoritative for each attribute

8-Step IRE Processing Flow

  1. Payload Received — data arrives from Discovery, Import Set, SGC, or API
  2. Normalization — Normalization Data Services (NDS) standardizes values (e.g., manufacturer names, OS names)
  3. Identification Rules Evaluated — rules are checked in priority order; first match wins and IRE STOPS at the first match
  4. Match Decision — single match means update existing CI; no match means create a new CI
  5. Multiple Match = De-duplication Task created — NO update occurs and processing halts for that CI
  6. Reconciliation — if updating, source precedence determines which values win per attribute
  7. CMDB Update — CI is created or updated in the CMDB
  8. Audit Logging — all changes are recorded in the CI's history
🔴First Match Wins

The IRE stops at the FIRST identification rule match. It does NOT continue evaluating lower-priority rules. This is a frequently tested concept on the exam.

Identification Rule Components

  • CI Class — the CMDB class the rule applies to
  • Identification Attributes — the field combinations used for matching
  • Order — determines evaluation sequence (first match wins)
  • Conditions — optional filters to narrow when the rule applies

Example: Computer Identification

For computers, a common identification rule setup is: Rule Order 1 uses serial_number AND manufacturer with the condition that serial_number IS NOT EMPTY. This ensures strong, reliable matching before falling back to weaker identifiers.

Identification Rules

Identification rules are defined per CI class and evaluated in priority order. They determine how incoming data is matched to existing CIs in the CMDB. Proper configuration of identification rules is essential for preventing duplicate CIs and ensuring data integrity.

Identifier Entries

Each identification rule contains identifier entries — attribute combinations with assigned priorities. A lower priority number means higher evaluation priority (checked first).

Independent vs Dependent CIs

  • Independent CIs — can be identified on their own using their attributes alone (e.g., servers, network devices)
  • Dependent CIs — require the parent CI to be matched first before identification can proceed (e.g., network interface cards, disk drives)

Inclusion Rules

Optional filters that narrow the set of candidate CIs considered during identification. They help reduce false matches by limiting the search scope.

Example Identification Rule Priorities

PriorityAttributesUse Case
100serial_number + serial_number_typeHardware CIs — strongest identifier
200fqdn + mac_addressNetwork devices — reliable combination
300nameFallback — avoid if possible due to non-uniqueness
💡Use Strong Identifiers

Always prefer strong identifiers like hardware serial numbers over weak identifiers like names. Names are not unique and lead to false matches and de-duplication tasks.

Testing and Supported Sources

  • Test identification rules using the CMDBTransformUtil API before deploying to production
  • Supported sources: Discovery, Service Mapping, IntegrationHub ETL, IRE APIs
📝IRE Is Mandatory

The IRE is mandatory for all IRE-enabled sources. It ensures CMDB accuracy by enforcing consistent identification and reconciliation across all data inputs.

Reconciliation and Data Source Precedence

Reconciliation occurs when multiple data sources provide information about the same CI. The reconciliation engine determines which source's value wins for each individual attribute based on data source priority.

Priority Rules

A lower priority number indicates higher authority. For example, a source with priority 100 beats a source with priority 200. This is per official ServiceNow documentation.

🔴Per-Attribute, Not Per-CI

Reconciliation is per-attribute, not per-CI. Different fields on the same CI can come from different winning sources. For example, Discovery might win for RAM while SCCM wins for installed software.

Reconciliation Scenarios

ScenarioResult
Single source, no existing CICreates new CI
Single source, matching existing CIUpdates existing CI
Multiple sources, no conflictsUpdates from all sources
Multiple sources with conflictsAuthoritative source (lowest priority number) wins
Discovery vs Manual entryTypically Discovery wins (lower priority number)

Data Refresh Rules

When a higher-priority source stops updating, after a configurable period (e.g., 14 days), the next-priority source is allowed to update those attributes. This prevents stale data from blocking updates from active sources.

Non-Reconcileable Fields

Certain fields like Comments or Assigned To can be configured to never be overwritten by automated sources. This protects manually curated data from being overridden by Discovery or integrations.

CMDB 360

CMDB 360 stores ALL values from all sources in the cmdb_multisource_data table. Even when reconciliation picks a winner, the losing values are preserved. Administrators can compare values from different sources and manually override if needed.

sys_object_source Table

The sys_object_source table maps external native keys to ServiceNow sys_ids. This is critical for tracking which external system created or owns each CI, and it enables the IRE to maintain the link between external records and CMDB CIs.

AD SPACE

Import Sets and Transform Maps

Import Sets are temporary staging tables for external data before it is transformed and loaded into the CMDB. They serve as a buffer that allows data to be validated, mapped, and cleaned before it becomes part of the production CMDB.

Key Characteristics

  • Support CSV, Excel, XML, and JSON formats
  • Import Set Tables are created automatically based on the source data structure
  • Data is temporary — automatically purged after successful transformation

Transform Maps

Transform Maps define the mapping rules between source (import set) columns and target (CMDB) fields. They control how raw data becomes structured CI records.

Transform Map Components

  • Source Table — the import set table containing raw data
  • Target Table — the CMDB table where data will be written
  • Field Maps — individual column-to-field mappings
  • Transform Scripts — custom logic executed during transformation

Transform Script Execution Order

PhaseScript TypePurpose
1onBeforeRuns before any field mapping — used for data validation
2Field mapsPer-field transformation and mapping
3onAfterRuns after all mapping — used for relationship creation
4script (field map level)Alternative to direct field map — conditional logic per field

Coalesce Fields

The coalesce field determines which source field(s) are used to identify existing records for updates vs inserts. When a coalesce match is found, the existing record is updated. When no match is found, a new record is inserted.

⚠️Key Exam Pitfalls

Watch for these common traps: (1) Confusing import set tables with CMDB tables — import sets are staging, not production. (2) Misunderstanding script execution order — onBefore runs first, then field maps, then onAfter. (3) Forgetting that import set data is automatically purged after successful transformation. (4) Not recognizing when to use coalesce vs matching conditions.

Service Graph Connectors (SGC)

Service Graph Connectors are pre-built, certified integration packages for importing third-party data into the CMDB following CSDM standards. They are the recommended approach for integrating external data sources whenever a connector is available.

Value Proposition

  • Reduced implementation time — pre-built mappings and logic
  • Improved data quality — certified identification rules
  • CSDM compliance out-of-the-box
  • ServiceNow-managed updates — connectors are maintained and upgraded by ServiceNow

Architecture

SGCs are built on IntegrationHub ETL as the processing engine. They use the Robust Transform Engine (RTE), NOT classic transform maps.

SGC Components

  • Data Source Configuration — connection details and credentials
  • Staging Tables — temporary storage for incoming data
  • Transformation Logic — mapping and conversion rules
  • Identification Rules — certified rules for CI matching
  • Monitoring Dashboards — visibility into connector health and data flow

Setup Process

  1. Installation from the ServiceNow Store
  2. Configuration — set up credentials, connection parameters, and schedule
  3. Mapping Review — verify and customize field mappings as needed
  4. Test Execution — run the connector in a test environment
  5. Production Deployment — enable the scheduled connector in production

Editing SGC Mappings

To edit SGC mappings: Access IntegrationHub ETL Studio, navigate to Step 3: Select CMDB Classes. You MUST mark Steps 1 and 2 as complete first before Step 3 mappings become editable. Then click Edit Mapping to customize.

📝Steps 1 and 2 Must Be Complete

This is a critical exam point: Steps 1 and 2 must be marked as complete before Step 3 mappings become editable in IntegrationHub ETL Studio. This is a commonly tested scenario.

Common Connector Categories

CategoryExamplesCI Types
Cloud PlatformsAWS, Azure, GCPVirtual instances, storage, cloud services
Endpoint ManagementSCCM, Intune, TaniumComputers, devices, installed software
ObservabilityDatadog, DynatraceApplications, dependencies, services
NetworkInfoblox, ClarotyIP addresses, network devices
MDMJamf, Workspace ONEMobile devices

Key Facts

  • SGCs use Robust Transform Engine (RTE), NOT classic transform maps
  • Connectors run in isolated application scopes for upgrade safety
  • SGCs are ALWAYS preferred over custom Import Sets when a connector is available for the data source

Discovery

Discovery is ServiceNow's automated tool for finding and mapping IT infrastructure. It probes networks, identifies devices and applications, and populates the CMDB with CI data. All Discovery data flows into the CMDB through the IRE.

Discovery Types

  • Horizontal Discovery — broad network scanning to find devices across IP ranges; casts a wide net to identify everything on the network
  • Top-Down Discovery — starts from known services and maps dependencies downward; focuses on service context and relationships

Discovery Patterns

  • Behavior Pattern — defines how to probe and classify a device or application
  • Quick Pattern — simplified pattern for rapid deployment
  • Application Pattern — specific to application discovery and dependency mapping
📝Exam Focus Areas

The exam emphasizes how Discovery feeds data into the CMDB through the IRE, when to use Horizontal vs Top-Down Discovery, and the impact of Discovery on CMDB Health scores.

AD SPACE

Normalization

Normalization standardizes CI attribute values from various raw sources into a consistent, canonical format. Without normalization, the same manufacturer or OS could appear as dozens of different strings, making reporting and management unreliable.

Normalization Data Services (NDS)

ServiceNow uses Normalization Data Services (NDS) to perform normalization. NDS is applied during the identification and reconciliation process, ensuring data is standardized before it is committed to the CMDB.

What Gets Normalized

  • Manufacturer names (e.g., IBM vs International Business Machines → IBM)
  • Model names (e.g., various model string formats → canonical model name)
  • Operating system values (e.g., win10, Windows 10 Pro, W10 → Microsoft Windows 10)

Normalization Process Flow

Source data → Transform Map → Field Maps → Transform Scripts (normalization logic) → Standardized CMDB values. Normalization can occur at multiple points in the ingestion pipeline, but NDS is the primary mechanism.

💡Normalization Example

The values "win10", "Windows 10 Pro", and "W10" all normalize to "Microsoft Windows 10". This ensures consistent reporting and accurate CI counts regardless of how different sources label the same OS.

Upgrade-Safe Ingestion Patterns

Maintaining upgrade safety is critical when customizing CMDB ingestion processes. ServiceNow regularly releases platform upgrades, and any modifications to out-of-the-box (OOB) components may be overwritten or cause conflicts during upgrades.

🔴Golden Rule

Do NOT modify out-of-the-box (OOB) components. Always create custom versions instead.

Upgrade-Safe Techniques

TechniqueDetails
Custom Tables & FieldsExtend cmdb_ci with custom tables; keeps custom data separate from OOB schema
Custom Data SourcesCreate new data sources instead of editing OOB definitions
Custom Transform MapsWrite custom scripts without altering OOB transform map logic
Custom Identification RulesAvoid editing OOB rules; use the order field to control evaluation sequence
Use of u_ PrefixClearly demarcates customizations; fields with u_ prefix are auto-preserved during upgrades

Standard Upgrade-Safe Workflow

  1. Create Custom Data Source
  2. Create Custom Transform Map
  3. Map Fields (source to target)
  4. Write Transform Scripts for custom logic
  5. Create Custom Identification Rules

Anti-Patterns (NOT Upgrade-Safe)

  • Directly editing an OOB Discovery Pattern
  • Modifying OOB Transform Map scripts
  • Changing OOB CMDB field definitions
  • Deleting OOB Identification Rules
💡Clone, Don't Edit

Need to change an OOB Discovery Pattern? Clone it and modify the clone. This preserves the original for upgrades while giving you full control over the customized version.

Ingestion Scenarios

Practice Scenarios

SCENARIO

Discovery reports Server-X has 8GB RAM. SCCM also reports on Server-X but says RAM is 16GB. Discovery has a reconciliation priority of 100 and SCCM has a priority of 200. What value is stored?

Answer

8GB (Discovery's value). In reconciliation, the LOWER priority number indicates HIGHER authority. Discovery (100) outranks SCCM (200). The CI record shows 8GB. However, CMDB 360 stores BOTH values in the cmdb_multisource_data table, so administrators can compare and override if needed.

SCENARIO

An Import Set payload matches two existing CIs based on identification rules. What happens?

Answer

The IRE HALTS processing. It does NOT update either CI. It creates a De-duplication Task and flags both records for manual resolution. This safety mechanism prevents incorrect data merges when the identification is ambiguous.

SCENARIO

The SCCM integration stopped updating 3 weeks ago, but Discovery still runs. A Data Refresh Rule is configured for 14 days. What happens to SCCM-updated attributes?

Answer

If SCCM had higher priority for certain attributes and those attributes have a Data Refresh Rule set to 14 days, then after 14 days of no SCCM updates, Discovery (the next-priority source) would be allowed to update those specific attributes.

SCENARIO

Which ingestion method should be recommended for importing AWS infrastructure data into the CMDB?

Answer

Service Graph Connector for AWS. Service Graph Connectors are always the preferred method because they come with pre-built IRE mappings, CSDM-aligned class structures, and are maintained by ServiceNow. Only use custom Import Sets or ETL when no SGC exists for the data source.

SCENARIO

A dependent CI (Network Interface Card) fails to process with a MISSING_DEPENDENCY error. What is the cause?

Answer

The parent CI (the Server) could not be identified by the IRE. Dependent CIs require their parent to be resolved first. The parent CI must be successfully identified before the dependent CI can be processed. Ensure the parent CI is ingested before its dependents.

Key Takeaways

  • The IRE is the core engine — it identifies, reconciles, and processes all incoming CI data
  • IRE stops at the FIRST identification rule match and does NOT continue evaluating lower-priority rules
  • When IRE finds multiple matches, it creates a De-duplication Task — NO update occurs
  • Lower reconciliation priority number = higher authority (Discovery at 100 beats SCCM at 200)
  • Reconciliation is per-attribute, not per-CI — different fields can come from different sources
  • Service Graph Connectors are ALWAYS preferred over custom Import Sets when available
  • SGCs use IntegrationHub ETL and Robust Transform Engine (RTE), not classic transform maps
  • Steps 1 and 2 must be marked complete before editing SGC mappings in Step 3
  • Import Set data is automatically purged after successful transformation
  • Transform script order: onBefore → field maps → onAfter
  • CMDB 360 stores all source values in cmdb_multisource_data for comparison
  • sys_object_source maps external native keys to ServiceNow sys_ids
  • Never modify OOB components — always clone/extend for upgrade safety
  • Data Refresh Rules allow lower-priority sources to update when higher-priority sources go stale
AD SPACE

Community-created study aids. Not official ServiceNow exam content.