A Simple PPT With Data Concepts 1703019257
A Simple PPT With Data Concepts 1703019257
https://youtu.be/M39fO13CALY
Reinforcement learning
• Technique that draws on machine learning / Deep learning, and
optimization.
• The objective is for an "actor" to execute the most optimal "action",
given a "scenario".
• It is used in robotics, to make robots perform
• Also used in autonomous cars
• Famous Algorithms: Q-Learning
Types of Deep Learning networks
• Neural networks are classified according to their connection scheme
between nodes. From this, topologies and "Architectures" are
created.
• DNN: Directed/Dense Neural Network, the most basic, all nodes in
one layer are connected to all nodes in the next layer.
• CNN: Convolutional, nodes are connected by mimicking the local
view, through convolutional layers and filters.
• RNN: Recurrent, have nodes that go "back in time" storing memory of
previous data, useful for time series.
Most famous architectures in Deep Learning
• Transformers: Network that keeps an attention weight between each
data in a time series. They are used for "Sequence by sequence" tasks
in text generation and translation, music generation, etc.
• U-net: Convolutional network with a decoder and an encoder, for
image segmentation or image generation (Diffusers networks usually
use u-net).
• Autoencoder: Identical input and output, used to compress
information and discard noisy data / outliers.
Where to store data
Data models
• Conceptual
• Logics
• Physical
Data models
• Conceptual
• also known as domain models, provide a general picture of what the system
will contain, how it will be organized and what business rules apply.
Conceptual models are typically created as part of the initial requirements
gathering process for the project. They typically include entity classes (which
define the types of things that are important to represent the business in the
data model), their characteristics and constraints, the relationships between
them, and the relevant data security and integrity requirements.
• Logics
• Physical
Data models
• Conceptual
• Logics
• are less abstract and provide more details about the concepts and
relationships in the domain under consideration. One of the formal data
modeling notation systems is followed. These indicate data attributes, such as
data types and their corresponding lengths, and show the relationships
between entities. Logical data models do not specify any technical system
requirements.
• Physical
Data models
• Conceptual
• Logics
• Physical
• are less abstract and provide more details about the concepts and
relationships in the domain under consideration. One of the formal data
modeling notation systems is followed. These indicate data attributes, such as
data types and their corresponding lengths, and show the relationships
between entities. Logical data models do not specify any technical system
requirements.
Work frames
• TOGAF
• Business architecture, which defines the organizational structure, business strategy
and processes of the company.
• Data architecture, which describes the physical, logical and conceptual data assets
and how they are stored and managed throughout the lifecycle.
• Application architecture, which represents the application systems, and how they
relate to the main business processes and to each other.
• Technical architecture, which describes the technological infrastructure (hardware,
software and networks) required to support the most important applications.
• DAMA
• Zachman
Work frames
• TOGAF
• Iterate through the different phases
• DAMA
• Zachman
Work frames
• TOGAF
• DAMA
• DAMA International, originally founded as Data Management Association
International, is a non-profit organization dedicated to the advancement of
data and information management. Its data management body of knowledge,
DAMA-DMBOK 2, covers data architecture, as well as data governance and
ethics, data modeling and design, storage, security, and integration.
• Zachman
Work frames
• TOGAF
• DAMA
• Prioritize dimensions
• Evaluate each dimension by
Each data process
• Zachman
Data management systems
• Data warehouse: Structured. Eg: Relational, Snowflake or Star model
• Data lake: Unstructured + Catalog
• Data lakehouse: Unstructured + Structured + Catalog + Governance
Data architectures
• Data fabric
• Centralized architecture
• The data are on a "path".
From the origins to
consumers of analytics
• Data mesh
Data architectures
• Data fabric
• Data mesh
Advantages of data management
• Reduction of redundancy
• Improved data quality
• Enabling integration
• Data lifecycle management
• Others