Fundamentals of Data Engineering Index
Fundamentals of Data Engineering Index
Prerequisites
How to Contact Us
Acknowledgments
Business Responsibilities
Technical Responsibilities
The Continuum of Data Engineering Roles, from A to B
Conclusion
Additional Resources
Storage
Ingestion
Transformation
Serving Data
Security
Data Management
DataOps
Data Architecture
Orchestration
Software Engineering
Conclusion
Additional Resources
Event-Driven Architecture
Data Warehouse
Data Lake
Lambda Architecture
Kappa Architecture
Data Mesh
Conclusion
Additional Resources
Speed to Market
Interoperability
Cost Optimization and Business Value
FinOps
Our Advice
Location
On Premises
Cloud
Hybrid Cloud
Multicloud
Our Advice
Our Advice
Monolith
Modularity
The Distributed Monolith Pattern
Our Advice
Serverless
Containers
Our Advice
Asymmetric Optimization
Caveat Emptor
Data Management
DataOps
Data Architecture
Software Engineering
Conclusion
Additional Resources
APIs
Logs
Database Logs
CRUD
Insert-Only
Types of Time
Databases
APIs
Data Sharing
Security
Data Management
DataOps
Data Architecture
Orchestration
Software Engineering
Conclusion
Additional Resources
6. Storage
Solid-State Drive
Serialization
Compression
Caching
File Storage
Block Storage
Object Storage
Cache and Memory-Based Storage Systems
Streaming Storage
Data Platforms
Data Catalog
Data Sharing
Schema
Undercurrents
Security
Data Management
DataOps
Data Architecture
Orchestration
Software Engineering
Conclusion
Additional Resources
7. Ingestion
Frequency
Payload
Data Migration
Message and Stream Ingestion Considerations
Schema Evolution
Late-Arriving Data
Replay
Time to Live
Message Size
Location
APIs
EDI
Shell
SSH
SFTP and SCP
Webhooks
Web Interface
Web Scraping
Data Sharing
Upstream Stakeholders
Downstream Stakeholders
Undercurrents
Security
Data Management
DataOps
Orchestration
Software Engineering
Conclusion
Additional Resources
Queries
What Is a Query?
Data Modeling
Normalization
Transformations
Batch Transformations
Upstream Stakeholders
Downstream Stakeholders
Undercurrents
Security
Data Management
DataOps
Data Architecture
Orchestration
Software Engineering
Conclusion
Additional Resources
Trust
Data Products
Self-Service or Not?
Data Mesh
Analytics
Business Analytics
Operational Analytics
Embedded Analytics
Machine Learning
File Exchange
Databases
Streaming Systems
Query Federation
Data Sharing
Semantic and Metrics Layers
Reverse ETL
Undercurrents
Security
Data Management
DataOps
Data Architecture
Orchestration
Software Engineering
Conclusion
Additional Resources
People
Always Be Paranoid
Processes
Active Security
Technology
Encryption
Network Access
Conclusion
Additional Resources
Moving Beyond the Modern Data Stack, Toward the Live Data
Stack
Conclusion
Serialization Formats
Row-Based Serialization
Columnar Serialization
Hybrid Serialization
B. Cloud Networking
Availability Zones
Regions
CDNs
Index