Final Examination - Class
Final Examination - Class
SUBJECT: BIOINFORMATICS
Dean of School of Lecturer Proctor 1 Score
Biotechnology Signature:
Signature:
Proctor 2
1. Which of the following is incorrect about data collection from DNA microarray analysis?
a. The most common type of microarray protocol is the two-color microarray
b. The two-color microarray uses multiple dyes at times for each sample
c. The cDNAs are obtained by extracting total RNA or mRNA from tissues or cells
and incorporating fluorescent dyes in the DNA strands during the cDNA
biosynthesis
d. The expression of genes is measured via the signals from cDNAs hybridizing with
the specific oligonucleotide probes on the microarray
e. The image of the hybridized array is captured using a laser scanner that scans every
spot on the microarray
2. In the bootstrap method, the data are resampled by _____ choosing _____ columns from
the aligned sequences to produce, in effect, a new sequence alignment of the _____
a. randomly, horizontal, same length
b. randomly, vertical, same length
c. randomly, vertical, different lengths
d. specifically, vertical, same length
e. specifically, vertical, different lengths
1
b. Sequencing – read mapping – normalization – read count – determining of
differentially expressed genes.
c. Sequencing – read mapping – read count – determining of differentially expressed
genes – normalization
d. Sequencing – normalization – read mapping – read count – determining of
differentially expressed genes.
e. Sequencing – read mapping – read count – normalization – determining of
differentially expressed genes.
4. A linear polymer of more than fifty amino acid residues is referred to as a ________
a. dipeptide
b. tripeptide
c. oligopeptide
d. peptide
e. polypeptide
6. A haplotype is
a. a set of SNPs on the same chromosome
b. a set of SNPs on different chromosomes
c. a set of SNPs together with other genetic markers on the same chromosome
d. a set of SNPs together with other genetic markers on different chromosomes
e. None of these
2
9. Phenotypic variation is analyzed for milk production in a herd of dairy cattle and the
following variance components are obtained:
Additive genetic variance (❑2A) = 0.3
Dominance genetic variance (❑2D ❑2D )= 0.2
Environmental variance (❑2E ❑2E )= 0.5
The broad-sense heritability of milk production is
a. 0.3
b. 0.2
c. 0.5
d. 0.7
e. 1
10. Phenotypic variation is analyzed for milk production in a herd of dairy cattle and the
following variance components are obtained:
Additive genetic variance (❑2A) = 0.3
Dominance genetic variance (❑2D ❑2D )= 0.2
Environmental variance (❑2E ❑2E )= 0.5
The narrow-sense heritability of milk production is
a. 0.3
b. 0.2
c. 0.5
d. 0.7
e. 1
3
. folds, superfamilies, and families
20 In GWAS for diseases in human, the definitions of cases and controls are the
. patients and the normal people.
C. Matching terms and figures (10 points) (each term can be used only once)
Terms:
a. Protocols f. Network
b. System g. Components
c. Interactions h. Modules
d. Single Nucleotide Polymorphism (SNP) i. Microsatellite
e. Systems biology j. Phenotypic marker
Figures Terms
Questions 21 & 22 refer to the Figure A System
21. Nodes
22. Edges
Questions 23 & 24 refer to the Figure B
(B)
23. LEGO bricks Components
24. Snap-connections Interactions
25. The sequence in Figure C is an example of i.
Microsatellite
(C)
4
D1. RPKM (Reads Per Kilobase Million) is a normalized estimation of gene expression based on
RNA-seq data. You have sequenced two libraries (liver and muscle). The total number of reads is
2,209,003 for the liver library and 1,503,797 for the muscle library. There are 5,000 reads from the
liver data and 6,000 reads from the muscle data matched to a given gene with a length of 2,000 bp.
Question Answer
26. Calculate RPKM for the muscle
27. Calculate RPKM for the liver
D2. Figure D describes relationship between 9 globin proteins. Calculate the following distances:
Question Answer
28. Distance between hbb-chimp & hbb-chicken =0.15+ 0.06+ 0.05+ 0.05=0.31
29. Distance between hbb-human & hbb-mouse =0.05+ 0.05+ 0.11= 0.21
30. Which groups in the NJ tree have Bootstrap There are 2 groups have Bootstrap value =
value = 100% 100%
+ Group of hbg2_chicken_cds and
hbb_cds including hbb_human_cds,
hbb_chimp_cds, hbb_dog_cds,
hbb_mouse_cds
+ Group of globin_soybean_cds and
globin_rice_cds
5
E. Performing ProtParam (https://web.expasy.org/protparam/) for the accession number P05130.
Complete the table below (15 points):
F. Accessing to NCBI and obtaining a sequence of the accession number P62166. Performing NetPhos
(http://www.cbs.dtu.dk/services/NetPhos/) for the sequence of P62166. Complete the table below (14
points):