![]() The languages, which help in bioinformatics, are C, C++, JAVA, FORTRAN, LINUX, UNIX etc. The biologists should also be familiar with key algorithms (set of steps). A biologist needs to know programming, optimization (code) and cluster analysis, as they are bioinformatics methods. To specialize in bioinformatics, knowledge of both biology and information computer technology is required. Y- Pyrimidines (thymidine, cytosine) U-UridineH - ACT R- Purines (guanine, adenine) T- Thymidine The nucleic acid codes as follows (FASTA format)Ī-adenosine B-GTC C-cytidine D-GATG-guanosine ![]() T- Threonine Z-Glutamate/glutamine X-any *-Translation stop ![]() P-Proline Q-Glutamine R-Arginine S- Serine I-Isoleucine M-Methionine K-Lysine N-asparagine The accepted amino acid codes for proteins are given below.Ī-Alanine B-aspartate / asparagines C-Cystine D-AspartateĮ-Glutamate F-Phenylalanine G-Glycine H-Histidine In the representation of either the nucleotides or the proteins, IUB/ IUPAC standards are followed. The cDNA libraries are smaller than genomic libraries and contain only DNA molecules for genes. In cDNA copies of messenger RNA are made by using reverse transcriptase enzymes. ![]() The DNA sequence data are then stored in a computer accessible form.Ī DNA library is a collection of DNA fragments, which contains all the sequences of a single organism. Nowadays automated sequencing machines called sequenators are developed to read hundreds of bases in the DNA. The sequence of bases in the DNA fragments can be identified by chemical / biochemical methods. By comparing the distances that the DNA fragments migrate, their number of bases could be distinguished. Smaller DNA fragments move faster than longer ones. When DNA of an organism is subjected to electrophoresis they migrate towards the positive electrode because DNA is a negatively charged molecule. The genome of an organism can be split up into different sized molecules by a technique called electrophoresis. SWISS-PROT groups at SIB (Swiss Institute of Bioinformatics) and EBI (European Bioinformatics Institue) have developed the protein sequence databases. The protein sequence databases elucidate the high level annotations such as the description of the protein functions their domain structure (configuration), amino acid sequence, post-translational GDB - Genome Database - Human.Įuropean Molecular Biology Laboratory (EMBL) National Centre for Biotechnology Information (NCBI) and DNA data bank of Japan ( DDBJ) are the three premier institutes considered as the authorities in the nucleotide sequence databases. Secondary databases also called as value added databases contain annotated data and information eg., OMIN - Online Mendelian Inheritance in Man. Primary databases contain the data in their original form taken as such from the source eg., Genebank (NCBI/USA) Protein, SWISS-PROT (Switzerland), Protein 3D structure etc. The nucleic acid databases are again classified into primary databases and secondary databases. Structure databases are the individual records of macromolecular structures. The former is the nucleic acid databases and the latter are the protein sequence databases.ī. Sequence databases are the sequence records of either nucleotides or amino acids. Generalized databases contain sequence database and structure databases.Ī. RNA databases are included under specialized data bases. Databases of Expressed Sequence Tags (ESTs), Genome Survey Sequences (GSS), Single Nucleotide Polymorphisms (SNPs) sequence Tagged sites (STSs). Structural organisation of DNA, protein, carbohydrates are included under generalized databases. Database is a general repository of voluminous information or records to be processed by a programme.ĭatabases are broadly classified as generalized databases and specialized databases. 'Creating' database means a coherent collection of data with inherent meaning, used for future application.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |