0% found this document useful (0 votes)

42 views12 pages

8-9 Spatial Data Maintenance

This document discusses spatial data management and geodatabases. It covers topics like spatial data indexing, data migration, transformation, enhancement, integration and conflation. Data migration involves moving data between storage systems while preserving format and content. Data transformation converts data between formats, often cleaning and validating it. Data conflation combines overlapping geospatial datasets to create a composite with better quality than the originals. A geodatabase is a collection of geographic datasets in a common file system or database that implements a comprehensive data model for geographic information.

Uploaded by

kevin.kipchoge18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views12 pages

8-9 Spatial Data Maintenance

Uploaded by

kevin.kipchoge18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

UNIT CODE: GGI 4202

UNIT NAME: SPATIAL BUSINESS

INTELLIGENCE

Lecture 06-07
Spatial Data Management
Overview of Spatial Data Management
 Spatial database management deals with the storage, indexing, and
querying of data with spatial features, such as location and geometric
extent.
 Many applications require the efficient management of spatial data,
including Geographic Information Systems, Computer Aided Design, and
Location Based Services.
 Spatial indices are used by spatial databases (databases which store
information related to objects in space) to optimize spatial queries.
 Conventional index types do not efficiently handle spatial queries such as
how far two points differ, or whether points fall within a spatial area of
interest.
 Data management means more than simply handling updates. Here are
four scenarios that we often see.
 Data migration
 Data transformation
 Data enhancement
 Data integration
 Data conflation
2
Data Migration
 The process of selecting, preparing, extracting, and transforming data
and permanently transferring it from one computer storage system to
another.
 It is a key consideration for any system implementation, upgrade, or
consolidation, and it is typically performed in such a way as to be as
automated as possible, freeing up human resources from tedious tasks.
 It occurs for a variety of reasons, including server or storage equipment
replacements, maintenance or upgrades, application migration, website
consolidation, disaster recovery, and data center relocation.
 Categories
 Storage migration: Result in having to move physical blocks of data from one disk to
another, often using virtualization techniques. The data format and content itself will
not usually be changed in the process.
 Database migration: Similarly, it may be necessary to move from one database vendor
to another, or to upgrade the version of database software being used
 Application migration: Changing application vendor like a new CRM or ERP platform
 Business process migration: Business processes operate through a combination of
human and application systems actions. When these change they can require the
movement of data from one store, database or application to another to reflect the
changes to the organization. 3
Disadvantages of Data Migration
 Migration addresses the possible obsolescence of the data carrier but
does not address the fact that certain technologies which run the data
may be abandoned altogether, leaving migration useless.
 Time-consuming – migration is a continual process, which must be
repeated every time a medium reaches obsolescence, for all data objects
stored on a certain media.
 Costly - an institution must purchase additional data storage media at
each migration
 If you have poor data quality now in your old data management system
and you plan to migrate that data into your new system, your new system
will most likely inherit the same challenges, headaches, and poor data
quality

4
Data Transformation
 Data transformation is the process of converting data from one format,
such as a database file, XML document or Excel spreadsheet, into
another.
 Transformations often involve converting a raw data source into a
cleansed, validated and ready-to-use format.
 Data transformation can be simple, or complex based on the required
changes to the data between the source (initial) data and the target
(final) data.
 Data transformation can be divided into the following steps, each
applicable as needed based on the complexity of the transformation
required.
 Data discovery: Typically the data is profiled using profiling tools or sometimes using
manually written profiling scripts to better understand the structure and characteristics
of the data and decide how it needs to be transformed.
 Data mapping: The process of defining how individual fields are mapped, modified,
joined, filtered, aggregated etc. to produce the final desired output
 Code generation: The process of generating executable code (e.g. SQL, Python, R, or
other executable instructions) that will transform the data based on the desired and
defined data mapping rules
5
Data Transformation…
 Code execution: Step whereby the generated code is executed against the data to
create the desired output. The executed code may be tightly integrated into the
transformation tool, or it may require separate steps by the developer to manually
execute the generated code
 Data review: is the final step in the process, which focuses on ensuring the output data
meets the transformation requirements. Any anomalies or errors in the data that are
found and communicated back to the developer or data analyst as new requirements
to be implemented in the transformation process.
 Types of Data Transformation
 Batch Data Transformation
 This is whereby developers write code or implement transformation rules in a data
integration tool, and then execute that code or those rules on large volumes of data.
 Batch data transformation is the cornerstone of virtually all data integration
technologies such as data warehousing, data migration and application integration.
 Interactive Data Transformation
 This is an emerging capability that allows business analysts and business users the
ability to directly interact with large datasets through a visual interface, understand the
characteristics of the data (via automated data profiling or visualization), and change or
correct the data through simple interactions such as clicking or selecting certain
elements of the data.

6
Data Conflation
 Geospatial data conflation is the compilation or reconciliation of two
different geospatial datasets covering overlapping regions (Saalfeld
1988).
 In general, the goal of conflation is to combine the best quality elements
of both datasets to create a composite dataset that is better than either
of them.
 The consolidated dataset can then provide additional information that
cannot be gathered from any single dataset.
 Based on the types of geospatial datasets dealt with, the conflation
technologies can be categorized into the following three groups.
 Vector to vector data conflation: A typical example is the conflation of two road
networks of different accuracy levels.
 Raster to raster data conflation
 Rasta to vector data conflation

7
Introduction: Geodatabase
 At its most basic level, a geodatabase is a collection of geographic
datasets of various types held in a common file system folder, a Microsoft
Access database, or a multiuser relational DBMS (such as Oracle,
Microsoft SQL Server, PostgreSQL, Informix, or IBM DB2).
 Geodatabases come in many sizes, have varying numbers of users and
can scale from small, single-user databases built on files up to larger
workgroup, department, and enterprise geodatabases accessed by many
users.
 It is the physical store of geographic information, primarily using a
database management system or file system.
 Geodatabases have a comprehensive information model for representing
and managing geographic information.
 This comprehensive information model is implemented as a series of tables holding
feature classes, raster datasets, and attributes.
 In addition, advanced GIS data objects add GIS behavior; rules for managing spatial
integrity; and tools for working with numerous spatial relationships of the core
features, rasters, and attributes
 Geodatabases have a transaction model for managing GIS data workflows.

8
Types: Personal geodatabases
 Personal geodatabases—All datasets are stored within a Microsoft Access
data file, which is limited in size to 2 GB.
 Original data format for ArcGIS geodatabases stored and managed in
Microsoft Access data files.(This is limited in size and tied to the Windows
operating system.)
 Single user and small workgroups with smaller datasets: some readers and one writer.
Concurrent use eventually degrades for large numbers of readers.
 All the contents in each personal geodatabase are held in a single Microsoft Access file
(.mdb).
 Two GB per Access database. The effective limit before performance degrades is
typically between 250 and 500 MB per Access database file.
 Often used as an attribute table manager (via Microsoft Access). Users like the string
handling for text attributes.

9
File Geodatabases
 A collection of various types of GIS datasets held in a file system folder
 Stored as folders in a file system.
 Each dataset is held as a file that can scale up to 1 TB in size. The file
geodatabase is recommended over personal geodatabases.
 Features:
 Provide a widely available, simple, and scalable geodatabase solution for all users.
 Provide a portable geodatabase that works across operating systems.
 Scale up to handle very large datasets.
 Use an efficient data structure that is optimized for performance and storage.
 File geodatabases also allow users to compress vector data to a read-only format to
reduce storage requirements even further.
 Outperform shapefiles for operations involving attributes and scale the data size limits
way beyond shapefile limits

10
Enterprise geodatabases
 Enterprise geodatabases—Also known as multiuser geodatabases, they can
be unlimited in size and numbers of users.
 A collection of various types of GIS datasets held as tables in a relational
database
 Stored in a relational database using Oracle, Microsoft SQL Server, IBM DB2,
IBM Informix, or PostgreSQL.
 Features:
 Extremely large, continuous GIS databases
 Many simultaneous users
 Long transactions and versioned workflows
 Relational database support for GIS data management (providing the benefits of a relational
database for scalability, reliability, security, backup, integrity, and so forth)
 SQL types for Spatial in all supported DBMSs (Oracle, SQL Server, PostgreSQL, Informix, and
DB2)
 High performance that can scale to a very large number of users
11
The architecture of a geodatabase
 The geodatabase is object relational
 Based on a series of simple yet essential relational database concepts and
leverages the strengths of the underlying database management system.
 Simple tables and well-defined attribute types are used to store the schema,
rule, base, and spatial attribute data for each geographic dataset.
 Through this approach, structured query language (SQL)—a series of
relational functions and operators—can be used to create, modify, and query
tables and their data elements

Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
Unit 2 Data Management and Processing System
100% (1)
Unit 2 Data Management and Processing System
43 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
Session 9
No ratings yet
Session 9
12 pages
Semantic Translation: Fundamentals and Applications
From Everand
Semantic Translation: Fundamentals and Applications
Fouad Sabry
No ratings yet
GIS Data Management: Ge 118: Introduction To Gis Engr. Meriam M. Santillan Caraga State University
No ratings yet
GIS Data Management: Ge 118: Introduction To Gis Engr. Meriam M. Santillan Caraga State University
47 pages
Lecture 4
No ratings yet
Lecture 4
39 pages
CH - 1 Relational Database Design Updated
No ratings yet
CH - 1 Relational Database Design Updated
80 pages
Designed Effective Data Presentation
No ratings yet
Designed Effective Data Presentation
39 pages
Ch-1 Introduction and Overview
No ratings yet
Ch-1 Introduction and Overview
22 pages
Planning An Enterprise Geodatabase Solution
No ratings yet
Planning An Enterprise Geodatabase Solution
75 pages
1 Data Management: Q 2006 by Taylor & Francis Group, LLC
No ratings yet
1 Data Management: Q 2006 by Taylor & Francis Group, LLC
10 pages
Gis Spatial Analysis
No ratings yet
Gis Spatial Analysis
56 pages
Characteristics of Spatial Data
No ratings yet
Characteristics of Spatial Data
10 pages
Unit-1 PPT Dma
No ratings yet
Unit-1 PPT Dma
83 pages
Gis Databases
No ratings yet
Gis Databases
48 pages
BIG DATA 1 Unit
100% (1)
BIG DATA 1 Unit
17 pages
Flaherty Anthony Future Directions of Spatial Technology
No ratings yet
Flaherty Anthony Future Directions of Spatial Technology
34 pages
Database Creation and Use
No ratings yet
Database Creation and Use
14 pages
Data Mining 1
No ratings yet
Data Mining 1
13 pages
Geomedia Professional
No ratings yet
Geomedia Professional
2 pages
Database Concepts: University of Missouri Columbia
No ratings yet
Database Concepts: University of Missouri Columbia
62 pages
Lecture 3
No ratings yet
Lecture 3
9 pages
CC6 Lec Chapter 8
No ratings yet
CC6 Lec Chapter 8
30 pages
GIS Data Storage and Management
No ratings yet
GIS Data Storage and Management
22 pages
Lecture 2
No ratings yet
Lecture 2
25 pages
The Power of Big Data: Transforming Industries and Shaping the Future
From Everand
The Power of Big Data: Transforming Industries and Shaping the Future
Tom Henricksen
No ratings yet
Databases and Information Management: Raw Facts Without Context or Intent
No ratings yet
Databases and Information Management: Raw Facts Without Context or Intent
14 pages
Databases and Information Management
No ratings yet
Databases and Information Management
41 pages
What Is Need of Big Data in Enterprises and How It Is Different From Business Intelligence
No ratings yet
What Is Need of Big Data in Enterprises and How It Is Different From Business Intelligence
56 pages
#2 Data Science
No ratings yet
#2 Data Science
32 pages
SK Gis Unit IV Part1 Dbms
No ratings yet
SK Gis Unit IV Part1 Dbms
37 pages
Debezium in Action: Definitive Reference for Developers and Engineers
From Everand
Debezium in Action: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Geodatabas PPT Final 2017
No ratings yet
Geodatabas PPT Final 2017
101 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Data Mining
No ratings yet
Data Mining
84 pages
Lecture 8-Is Infrastructure DBMS
No ratings yet
Lecture 8-Is Infrastructure DBMS
34 pages
PrestoDB in Practice: Definitive Reference for Developers and Engineers
From Everand
PrestoDB in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
1 Gis
No ratings yet
1 Gis
38 pages
DB Concept
No ratings yet
DB Concept
37 pages
Week 5 Database
No ratings yet
Week 5 Database
34 pages
Emergency Chapter Two
No ratings yet
Emergency Chapter Two
41 pages
Redshift Essentials: Definitive Reference for Developers and Engineers
From Everand
Redshift Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Sqoop Essentials: Definitive Reference for Developers and Engineers
From Everand
Sqoop Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Course Name: Introduction To Emerging Technologies
No ratings yet
Course Name: Introduction To Emerging Technologies
24 pages
Chapter 1
No ratings yet
Chapter 1
41 pages
This PPT Is Dedicated To My Inner Controller Founders.: Amma Bhagavan
No ratings yet
This PPT Is Dedicated To My Inner Controller Founders.: Amma Bhagavan
84 pages
Chapter Two
No ratings yet
Chapter Two
14 pages
Big Data Vs Traditional Database
No ratings yet
Big Data Vs Traditional Database
19 pages
Chapter 2 EmTe
No ratings yet
Chapter 2 EmTe
37 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Components of GIS
No ratings yet
Components of GIS
33 pages
Lecture 16
No ratings yet
Lecture 16
31 pages
Emerging Research Emerging Research Directions in Dbs/Iss
No ratings yet
Emerging Research Emerging Research Directions in Dbs/Iss
51 pages
DAUnit 1
No ratings yet
DAUnit 1
20 pages
Creating A Database
No ratings yet
Creating A Database
4 pages
Amruta Vijaykumar Lotake Mobile: 8623980940/7767935932 E-Mail: Career Objective
No ratings yet
Amruta Vijaykumar Lotake Mobile: 8623980940/7767935932 E-Mail: Career Objective
3 pages
84311737514719120-Class8WorksheetLs4andLs5 MSACCESS2010AnswerKey PDF
No ratings yet
84311737514719120-Class8WorksheetLs4andLs5 MSACCESS2010AnswerKey PDF
5 pages
Introduction and Starting LibreOffice Base MCQ
No ratings yet
Introduction and Starting LibreOffice Base MCQ
3 pages
SQL MCQ
No ratings yet
SQL MCQ
3 pages
Hadoop Pig Presentation
No ratings yet
Hadoop Pig Presentation
33 pages
01-Mini Project
No ratings yet
01-Mini Project
3 pages
1622 Asm1
No ratings yet
1622 Asm1
23 pages
Sales Prediction in Tourism Industry Using Data Mining
No ratings yet
Sales Prediction in Tourism Industry Using Data Mining
9 pages
Unit II-Database Design, Archiitecture - Model
No ratings yet
Unit II-Database Design, Archiitecture - Model
23 pages
Database & File Concepts Key Terms
No ratings yet
Database & File Concepts Key Terms
23 pages
Xii Comp Sci MQP 11
No ratings yet
Xii Comp Sci MQP 11
9 pages
A Java Based University Library Management System
No ratings yet
A Java Based University Library Management System
10 pages
csc213 Week 1 Updated-1
No ratings yet
csc213 Week 1 Updated-1
3 pages
Farmers Buddy
No ratings yet
Farmers Buddy
114 pages
Upgrading MySQL Server
No ratings yet
Upgrading MySQL Server
15 pages
ERD Model
No ratings yet
ERD Model
19 pages
SIT 200 Database Management System
No ratings yet
SIT 200 Database Management System
4 pages
CS 604 Assignment 1 Solution BC 210410285
No ratings yet
CS 604 Assignment 1 Solution BC 210410285
4 pages
4directory Services - Coursera
No ratings yet
4directory Services - Coursera
1 page
Learner Responses
No ratings yet
Learner Responses
33 pages
Group Project Ims506
No ratings yet
Group Project Ims506
2 pages
Blind SQL
No ratings yet
Blind SQL
15 pages
Dbms Unit 2 Notes
No ratings yet
Dbms Unit 2 Notes
32 pages
Unit 6 Advanced Databases
No ratings yet
Unit 6 Advanced Databases
108 pages
Database Management System CS3492 - REGULATION 2021 Downloaded From Stucor App
No ratings yet
Database Management System CS3492 - REGULATION 2021 Downloaded From Stucor App
13 pages
Database Management Systems
No ratings yet
Database Management Systems
75 pages
Database by NJM
No ratings yet
Database by NJM
24 pages
ERDPlus
No ratings yet
ERDPlus
3 pages
Module 1 and 2: F.L. Vargas College Inc
No ratings yet
Module 1 and 2: F.L. Vargas College Inc
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

8-9 Spatial Data Maintenance

Uploaded by

8-9 Spatial Data Maintenance

Uploaded by

UNIT CODE: GGI 4202

UNIT NAME: SPATIAL BUSINESS

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.