Lecture 5
Lecture 5
From Wikipedia, . . .
From Wikipedia, . . .
From Wikipedia, . . .
LIMS are frequently used in industry, whereas academic labs have been slower
to adapt LIMS due to the large overhead involved in installing and maintaining
such systems. Several minimal LIMS or ELN systems for small research groups
are starting to appear, and with the advance of high throughput technologies
and Systems Biology it is likely that more academic labs and small research
groups will adopt LIMS or ELNs.
ABcontrols.com
Nucleic Acid Research
1. Logical
2. Physical
3. External
Data Abstraction
1. Logical
2. Physical = data files. Access is one of the few to store as a
single file. This is quite limiting for large databases or those
accessed by many people.
3. External
Data Abstraction
1. Logical
2. Physical
3. External = user views of the data. Multiple users can be
looking at the same database (physical files) and yet be
presented with completely different views.
Why layers?
I Flat files
I Hierarchical Models
I Network Models
I Relational Models
I Object-Orientated Model (OO)
I Object-Relational
Database models
I Flat files
I Hierarchical Models - Each file contains records with pointers
(generally these were physical pointers) to other records in a
one-to-many relationship (parent to children).
I Network Models
I Relational Models
I Object-Orientated Model (OO)
I Object-Relational
Database models
I Flat files
I Hierarchical Models
I Network Models - there is still a one-to-many relationship but
the child could have multiple parents. Similar to the web in
that each page may have many other pages that link to it.
Problem with this is that relationships become very complex
and can even have circular links.
I Relational Models
I Object-Orientated Model (OO)
I Object-Relational
Database models
I Flat files
I Hierarchical Models
I Network Models
I Relational Models - Should not need to follow predefined links
as in the web. The key to a relational model is to have a
common data item stored within each record. These are called
keys or ID’s. In this case, physically the rows/entries of each
record of data need not even be in the same file.
I Object-Orientated Model (OO)
I Object-Relational
Database models
I Flat files
I Hierarchical Models
I Network Models
I Relational Models
I Object-Orientated Model (OO) - brought about by the need
to handle images, audio, etc. ≡ complex data. An object is a
logical grouping of related data and associated program logic.
I Object-Relational
Database models
I Flat files
I Hierarchical Models
I Network Models
I Relational Models
I Object-Orientated Model (OO)
I Object-Relational - The OO methods lacks a simplistic ad-hoc
query capability. So people have created hybrids of Relational
and Object methods (e.g. Oracle, DB2).
Organizing Table structure
I Problems?
I What if someone is old (aka experienced) and
recognizes that Pongidae is the legacy family
name?
I All of this data is very repetitive?
I What happens if I misspell an entry?
I What happens if the taxonomists decide to
change the names? (e.g. Pan paniscus used to be
a subspecies of Pan troglodytes).
Table structure
Title
item ID
item
item
item
item
item
Table structure
An example of a (very
poor) table is shown here.
Note that there is lots of DNA SEQUENCE
information potentially SEQUENCE ID
associated with a sequence GENBANK ACC NUMBER
record beyond the element SAMPLE
itself. SEQUENCE TYPE (eg. mtDNA)
SEQUENCE MACHINE
Note that the table title is SEQUENCE SOFTWARE
“DNA SEQUENCE” TECHNICIAN
rather than “DNA TECHNICIAN LOCATION
SEQUENCES”; table titles TECHNICIAN EMAIL
SEQUENCE TEXT
are always singular. This is
because it represents one
object.