SAP Datasphere Data Builder
SAP Datasphere Data Builder
DATA BUILDER
SAP DSP – ‘Data Builder’ Overview
• Modeling Data
Once data is in, modelers can enrich it by adding semantic layers,
refining entities, and building focused analytical models. These models
are ready for use in SAP Analytics Cloud, Microsoft Excel, and other
integrated tools and applications.
kr_pavankumar
Local Table
1. Tables
Remote Table
Graphical View
2. View
SQL Views
3. ER Models
- Objects
Data Flow
5. Flows Replication Flow
Transformation Flow
6. Intelligent Lookups
7. Task Chain
kr_pavankumar
2. Views: Graphical vs. SQL
a) Graphical View: A no-code way to model data using a drag-and-drop interface. Supports
Filters, rename/exclude columns, calculated columns, aggregation &
Joins.
• Filter: Restrict data rows based on conditions (e.g., sales > 1000).
• Rename/Exclude Columns: Tweak column names or drop unnecessary ones for clarity.
• Calculated Columns: Create new columns with formulas (e.g., profit = revenue - cost).
• Aggregation: Summarize data (e.g., SUM, AVG) for reporting.
• Join Suggestion: Auto-suggests joins between tables based on defined Associations – Saves time!
b) SQL View: Write SQL code for complex logic. Ideal for Business requirements needing
precise control—like subqueries or CTEs or Procedures or advanced joins—beyond
what graphical views offer.
kr_pavankumar
Fact: to indicate that your entity contains numerical
measures that can be analyzed.
kr_pavankumar
Create Table: Build a new table within the entity-
relationship diagram.
4. ER Models:
Mapping Create View from Selection: Generate a view from
selected tables/views in the model.
Relationships
Create Association: Link entities (e.g., orders to
customers) for relational modeling.
a) Measures
Defi • Calculated Measure: Define custom metrics (e.g., profit margin = profit/revenue).
ne
Limi • Restricted Measure: Limit a measure by conditions (e.g., sales for 2024 only).
t
Cou • Count Distinct Measure: Count unique values (e.g., distinct customers).
nt
Con • Currency Conversion Measure: Convert values across currencies dynamically.
vert
Han • Non-Cumulative Measure: Handle non-additive metrics (e.g., stock levels).
dle
kr_pavankumar
5. Analytic Model: Powering Analytics
b) Variables
kr_pavankumar
•
1. Data Flow
✓ Key Features:
• Data can be extracted from multiple sources, transformed, and then loaded into
target tables.
• More flexible and customizable. Useful for building end-to-end data pipelines.
kr_pavankumar
•
2. Replication Flow
✓ Use Case: When you want to mirror or synchronize source tables as-is.
✓ Key Features:
• Supports real-time replication for supported sources (like SAP S/4HANA, SAP
BW).
• Minimal or no transformation.
• Ensures high performance and fast updates.
kr_pavankumar
•
Load Type Initial only/Initial & Delta: Full load or incremental updates.
kr_pavankumar
6. Flows: Moving and Transforming Data
3. Transformation Flow
✓ Use Case: When raw data needs to be cleaned, enriched, joined, filtered, or
aggregated before consumption.
✓ Key Features:
• Drag-and-drop interface to define transformation logic.
• Use of joins, filters, calculations, and aggregations.
• Output is often used for analytical models.
kr_pavankumar
•
kr_pavankumar
7. Intelligent Lookups: Smart Data Matching
Purpose: To match and enrich data from two different datasets—even if they
don’t have perfectly matching key fields.
Use Case: When you have master data and transactional data that share similar
(but not identical) attributes, and you want to intelligently map and combine
them.
Key Features:
• Uses machine learning to suggest the best matches between fields.
• Minimizes the need for exact joins or manual mapping.
• Great for data enrichment or combining messy, siloed datasets.
kr_pavankumar
7. Intelligent Lookups: Smart Data Matching
1 2 3 4
Input Table Node: Look up: Reference Rule(s): Define match Output View:
Your starting dataset. another table for logic—outputs include Resulting enriched
enrichment. Matched Records, view.
Unmatched, Purpose: Automates
Unprocessed, or Error. data lookups (e.g.,
adding customer
names to orders).
kr_pavankumar
➢Purpose: To automate and orchestrate a sequence of data-related tasks or
flows (like data load, transform, publish).
➢Use Case: When you want to schedule or automate a multi-step data pipeline
involving Data Flows, Transform Flows, or Replication
Flows.
➢Key Features:
• Build workflows of tasks.
• Configure dependencies and execution order.
• Supports scheduling and monitoring.
• Useful for end-to-end automation.
➢Use Case: When different users or roles should see only specific parts of a
dataset (e.g., region-based access for sales managers).
➢Key Features:
• Define DAC entities (e.g., region, country).
• Assign users or roles to specific DAC values.
• Integrated with SAP Analytics Cloud and other front-end tools for secure data
consumption.
• Ensures data privacy and governance compliance.
9. Data Access
Control: Permissions Entity: Links to a table with user/role
permissions.
Securing Data
kr_pavankumar