Course Structure: M. Sc. (Bioinformatics)
Department of Bioinformatics, Singhania University
| Course Code |
Course Title |
Internal Assessment |
Semester Exam |
Total |
| |
Semester I |
|
|
|
| MSBI111 |
Introduction of Bioinformatics |
30 |
70 |
100 |
| MSBI121 |
Statistics and Probability |
30 |
70 |
100 |
| MSBI131 |
Molecular Biology |
30 |
70 |
100 |
| MSBI141 |
Fundamental of Computer |
30 |
70 |
100 |
| MSBI151 |
Operating System (Windows, Linux) |
30 |
70 |
100 |
| MSBI161 |
Computer Lab I |
50 |
100 |
150 |
| |
Semester II |
|
|
|
| MSBI112 |
Object Oriented programming in C++ |
30 |
70 |
100 |
| MSBI122 |
Computational Method for Sequence Analysis |
30 |
70 |
100 |
| MSBI132 |
Molecular Imaging & Drug Designing |
30 |
70 |
100 |
| MSBI142 |
Database Management System & Data Mining |
30 |
70 |
100 |
| MSBI152 |
Genomics, Proteomics and HGP |
30 |
70 |
100 |
| MSBI162 |
Computer Lab II |
50 |
100 |
150 |
| |
Semester III |
|
|
|
| MSBI211 |
Perl programming for Bioinformatics |
30 |
70 |
100 |
| MSBI221 |
Computer Networks and Advanced Multimedia |
30 |
70 |
100 |
| MSBI231 |
Advanced Algorithm for Computational Biology |
30 |
70 |
100 |
| MSBI241 |
Gene Expression & MicroArray Analysis |
30 |
70 |
100 |
| MSBI251 |
Information Theory & Stochastic Process |
30 |
70 |
100 |
| MSBI261 |
Computer Lab III |
50 |
100 |
150 |
| |
Semester IV |
|
| MSBI212 |
Major Project |
100 |
300* |
|
400 |
| * (Project Report Evaluation: 150 & Comprehensive Vivavoce: 150) |
|
| Grand Total: |
2350 |
MSBI – 111: Introduction of Bioinformatics
The nature of chemical bonds, Introduction to Genes and Proteins, Nucleotides, Orientation, Base pairing, The central dogma, Promoter sequences, Genetic Code, ORFs, Introns and Exons, Slice variants, Protein structure, Primary, Secondary, Tertiary and Quaternary, The notation of homology. Introduction to Data Generating Techniques: Restriction Enzymes, Gel Electrophoresis, Blotting and Hybridization, Cloning, PCR. Biological databases, Search engines, Public databases: PubMED, EMBL, GenBanK, PDB, SwissPort. Genomics and Proteomics: Prokaryotic genomes, Eukaryotic Genomes, Gene Structure, GC Content in Eukaryotic genomes, Gene Expression, Protein Classification, 2D–Electrophoresis, Mass spectrometry, Microarray technology, X–ray crystallography, NMR, Sequence and Phylogeny Analysis, Detecting ORFs, Outline of sequence alignment, Introduction to BLAST, Multiple sequence alignment, Phylogenetic analysis.
Text / Reference Books:
- Fundamental Concepts of Bioinformatics, D.E. Krane and M.L. Raymer, Person Education, 2003.
- Bioinformatics Computing by B. Bergeron, Prentice –Hall, 2003.
- Bioinformatics: A practical guide to the analysis of genes and proteins, D. Baxivanis and F. Oulette, Wiely Indian Edition, 2001
- Bioinformatics: Sequence and Genome analysis, D.W. Mount, Cold Spring Harbour Laboratory Press, 2001
MSBI – 121: Statistics and Probability
Probability, Conditional Probability, Random Variables, Expected Value. Continuous and Discrete distributions: Binomial, Geometric, Poisson, Pascal, Uniform Exponential and Normal, Poison Process. Multidimensional random variables, Multidimensional and bivariate normal distribution, Moment generating functions. Law of large numbers, Central limit theorem,
Sampling distribution, Point and Internal Estimation Testing of hypothesis, Goodness of fit, Linear regression. Principal component analysis, Discriminant analysis, Cluster analysis, Multivariate analysis of variance.
Text / Reference Books:
- Statistical Methods in Bioinformatics: An Introduction by W. Evans & G. Grant, Springer, 2001
- First course in Probability by Sheldon Ross, PrenticeHall (India), 2001
MSBI – 131: Molecular Biology
Concepts in Biology: Characteristics of living organisms, Structure of cells, Energy flow; Nutrition and Metabolism; Information storage and expression, Reproduction and Inheritance. Biomolecules: Carbohydrates, Amino Acids, Proteins, Lipids, Nucleic acids. Enzymes; Properties, Activation energy, Reaction kinetics, Intercellular Communication, Flow of genetic information, DNA expression and replication. Genes; Changing size of Genes, Gene synthesis, Cloning (Plant and Animal), Mutation, Gene Targeting, Gene Therapy. Level of secondary structure, Basic concepts of molecular dynamics, Molecular simulation studies. Protein structure determination by Xray Diffraction, NMR, Introduction to nucleic acid structure, Genome organization, HGP. Text / Reference Books:
- Molecular Biology by David Friefelder, Jones & Burtlett Publishers inc, 2000
- Molecular Cell Biology by Alberts, Bruce Bray, Watson & James, Garland Publishing, 1999.
MSBI – 141: Fundamental of Computer
Digital Circuits and Computer Architecture: Transistors – and Integrated Circuits (LSI, VLSI); Operation of processor; Number Systems and Digital Circuits; ALU; Memory Chips, Storage Devices, Memory Hierarchy; I/O Devices; Bus Systems; Computer Organization and Architecture; Microprocessor; Moore’s Law, Operating Systems and DBMS: Windows and Linux; Basics of relational Databases; SQL. Programming Languages: Algorithms; Problem Solving Classification; Basics of C Language, Concepts of objects and classes, Introduction to objectoriented Programming. Computers Networks: Protocols; Wireless LAN and Mobile Computing. Introduction to Web Technology: HTML and Web pages; The Internet and Intranet; WWW; Java Script and Dynamic Web Pages; Multimedia Application; Computer Network Security, eCommerce.
Text/Reference Books:
- The Essential Guide to Computing: The story of Information Technology, E. Garrison Walters, PHI, 2001.
- A Balanced Introduction to Computer science by David Reed, PE, 2000.
- Introduction to Computing Systems: From Bits to Gates to C and Beyond. Yale N. Patt and Sanjay J. Patel. Prentice Hall India, 1999
MSBI – 151: Operating System Windows, Linux
Introductory concepts, Operating systems, Mainframe systems, Multiprocessors, Distributed systems, Real – Time systems, Computing environments. Computer system structure, Computer system organization, I/O structure, Storage structure, Storage hierarchy,
Hardware protection, Network structure. Operating system structures, System components, OS Services, System calls, System programs, System structure, Virtual machines Processes, Process concepts, Process scheduling, Operations on processes, Process states, Cooperating processes, Interprocess communication, Common OS problems. DOS and Windows, Introductory concepts, Architecture and special features, Useful DOS commands, DOS – Editor, EDIT & writing Batch Files, Different Versions and their Distinguishing features, Managing Operations with Windows. Linux: Introductory concepts, Architecture and special features, Useful Linux commands, Related operating system and their Introduction.
Text / Reference Books:
- Operating Systems by Glavin, Addison Wessely, 2000
- Modern Operating Systems by S. Tanenbaum, Prentice Hall (India), 1999
- Operating Systems Concept and Design by Milan Milenkovic, Tata McGraw Hill, 2000
MSBI – 161L Computer Lab I
Windows and Linux installation, user interface, File management, Data Security, Users management. Using Linux and Windows commands. Sequence Alignment Software: Searching for sequence databases; Protein databases, PDB, BLAST, FASTA, CLUSTALW
MSBI – 112: Object Oriented Programming in C++
Methodologies: Concepts of structured and ObjectOriented programming, Advantages of OOPs methodologies. Introduction to ObjectOriented programming: Characteristics of OOPs Languages, C++ programming Basics operators, Loops, Arrays, Pointers. Principle of ObjectOriented programming, Introduction to Abstraction, Encapsulation, Inline function, Reference variable, Access specifier, Information Hiding, Friend Functions, Friend Class. Operator overloading, Function overloading. Inheritance: Extending classes, Types of inheritance, Virtual base class, Problems with multiple inheritance, Containership. Advanced pointers, Virtual functions, Polymorphism, Pointers to Class, Array pointers to Class, Virtual FunctionAbstract Class, Pure Virtual Function, Files and Streams, Building class Libraries in C++. File Handling in C++, Template class, Exception Handling.
Text / Reference Book:
- Object Oriented Programming with C++, E. Balagurusamy, Tata McGraw Hill
- Object Oriented Programming in C++, Robert Lafore, Techmedia 2002
- C++, How to program by Deitel & Deitel, Person Edition, 2002
- The C++ Programming Languages by Stroustrup, Addison, Wesely., 2001
- Problem Solving the object of Programming, Walter Savitch, Pearson Edition, 2003
BTBI – 122: Computational Methods for Sequence Analysis
Introduction to sequence databases: DDBJ / EMBL / GenBank; Information retrieval from databases; Analysis of DNA and Protein sequences: distribution, pattern and motif searches, Segmentation. Sequence alignment: Scoring matrices, PAM, BLOSUM, Local and global alignment concepts; Dynamic programming; Needleman Wunch algorithm, Smith Waterman algorithm; Statistics of alignment score; Multiple sequence alignment. Database searches for homologous sequences: FASTA, BLAST and CLUSTALW, Comparison, Artificial Neural Network concepts, Preceptron, Multilayer preceptron, Secondary Structure Prediction using ANN. Evolutionary analysis: Distances, Clustering methods, Tree representation, Genome sequence assembly; Gene finding methods; Analysis and prediction of regulatory regions. Profiles and Hidden Markov Model, Gene identification and prediction, DNA Microarrys, Gene Expression, Clustering gene expression profiles.
Text / Reference Book:
- Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, A. D. Baxevanis and B. F. F. Ouellette John Wiley and Sons (ASIA) Pvt. Ltd. 2002
- Bioinformatics methods and applications Genomes, Proteomics and Drug Discovery, S. C. Rastogi, N. Mendiratla and P. Rastogi, Prentice Hall (India), 2004
- BLAST by I. Korf, M. Yandell & Joseph Bedell, O’Reilly, 2003
- Biological Sequence Analysis – Probabilitie Models of Proteins and Nucleic acids, R. Durbin,
S. Eddy, A. Krogh, G. Mitchison, Cambridge University Press, 1988
MSBI – 132: Molecular Imaging and Drug Design
Overview of Drug design, Simulation Methods, Molecular Mechanics, Monte Carlo, SemiImpirical Method, Abinito Method. Analog based drug design: QSAR, Molecular Structure determination, Advance crystallographic techniques, Simulation Anneling (Autodock /Quantum), Peptide Sequencing, EST Clustering. Comparative Protein Modelling: Homology Modeling, Structural Alignment, Ramachandran plot (Side Chain Placement Refinement & Validation of Model), 3DPharmacophore modeling. Computeraided drug designing: Drug design, Lead discovery, ADMET in drug design, Lipinski ”rule of 5”, Principle and Methods of Docking, Ligand based drug designing, Target identification, Drug Metabolism, Denovo design of drug. Structure based drug design. Introduction of Drug discovery, Pharmacogenomics, Analysis of SNP’s, Drug discovery technologies and strategies.
Text / Reference Book:
- Molecular Modelling: Principle and Applications, Andrew R. Leach, Longman Singapore Publisher, 1999
- Bioinformatics: From Genome to Drugs (Vol. I and II), Thomas Lengaue, WileyVCH, 2000
- Structural Bioinformatics, P. E. Bourne and B.H. weissing, Wiley VCH, 1999
- Bioinformatics Method and ApplicationsGenomics, Proteomics and Drug Discovery, S.C. Rastogi, N. Mendiratla and P. Rastogi, Prentice Hall (India), 2004
MSBI – 142: DataBase Management System and Data Mining
Introduction, Characteristics of Data base approach, Database users, Intended uses of databases, Implication of database approach. Database System Concepts and Architecture, Data Models, Schemas bad Instances, DBMS Architecture and data independence, database languages, Classification of DBMS. Database Design: Informal Design guidelines for relation Schemas, functional Dependencies, Normal forms based on
nd rd
primary keys, general Definition of 2and 3Normal forms, BCNF, Need of further Normalization. Data Modeling using ER diagram, ER Model Concepts, Notation for ER Diagrams. The Relational Model, Relational Model Concepts, Relational Model Constraints, Introduction to Relational Algebra. Data Mining, Definition, Data Mining and KDD, Data Mining on relational databases, Data Warehouses, Transactional databases etc. Data Mining Functionalities, Pattern Mining, Data Mining on Biological data.
Text/Reference Books:
- Fundamentals of Database System by Elmasari and Navathe, Prentice Hall (India), 2001
- Data Mining Concepts and techniquesJiawei Han, Micheline Kamber, Morgan Kaufmann publisher, 2001
MSBI – 152: Genomics, Proteomics and HGP
Primer Selection Phylogeny, Protein Sequence Analysis, SAGE, Primer Chain Reaction (PCR). Genomics, Fragmenting the genomes, Need of markers, Marker Sequencing (RFLP, AFLP & SNP, etc); Genome sequencing: Basic Sanger sequencing, Automated sequence, Gene Identification, MUMmer , Blast2. Proteomics, Microarray: The technique of microarray design, 2D gel electrophoresis, Peptide sequencing, Mass spectroscopy. Metabolomics, Metabolic pathway databases: KEGG, COG, WIT; Drug target identification. Functional genomics: Application of sequence based and structure based approach of gene function eg: Sequence comparison, Structure analysis (active site, binding site) and comparison; Approaches for determining gene expression patterns and functions. What is HGP: Goals, Progress & History of HGP.
Text / Reference Book:
- Handbook of Comparative Genomics: Principles and Methodology by T.B. Kitano, Graziano Pesole, 2003.
- Computational Molecular Biology by P. Clate & R. Backofen, Willy Publication, 1998 Proteomics,
- Timothy Palkill, kluwer Academic Publisher, 2002
- Introduction to Proteomics: Tools for the new Biology, Daniel C. Liebler, Humana Press, 2001
-
Foundation to systems Biology (Ed) by Hiroaki Kitano, MIT Press, 2001
MSBI – 162: Computer Lab II
Problem solving using C++ programming language. Sequence Alignment Software: Searching forsequence databases; Protein databases, PDB, BLAST, FASTA, CLUSTW, Compairing and aligning sequences; Phylogenetic trees for sequences. Creating database for Biological projects including Schema, ER Diagram, and Normalization. Developing database applications: SQL queries using Oracle. Clustering and classification software such as SOM, SVM, PCA; using visualization software such as TreeView, Sammon Mapping.
MSBI – 211: Perl Programming for Bioinformatics
Getting started with PERL, Perl’s Benefits, Versions of Perl, Downloading and installing Perl in Linux/Windows environment, How to run perl programs in Linux/Windows environment, Error Message, Debugging, Perl interpreters, Perl scripts. Scalar values and scalar variables, Strings and numbers, Assignment statements, Blocks, Arrays, Hashes, Operators, Binding operators, Loops, IO, Input from file, Input from SDIN, Input from file named in command line, Regular expression, Pattern matching, Meta symbols, Pattern modifiers, Subroutines and modules, Builtin functions. Application of Perl in Bioinformatics: Representing strings and sequenced data in Perl, Program to store a DNA sequence, Concatenating DNA fragments, DNA to RNA transcription, Reading proteins from files, Finding motifs, Counting nucleotides, Exploding strings into arrays, Operating on strings. A program to simulate DNA mutation, Generating random DNA, Analyzing DNA, Translating DNA into proteins, GenBank sequence and annotation, Parsing annotation, Indexing GenBank with database. PDB files, Parsing PDB files, Parsing BLAST output, Introduction to BioPerl, CPAN, and Python.
Text / Reference Book:
- James D.Tisdall, “Beginning Perl for Bioinformatics”, O’Rilley and Associates, 2001.
- Cynthia Gibas & Per Jamesbeck, “Developing ioinformatics Compuer skills”, O’ Rilley & Associates, 2000
- Rex A.Dawyer, “Genomic Perl”, Cambridge University Press.
rd
4. Learning Perl, 3Edition: Randal L.Schawrtz and Tom Phoneix, O’Rilley
MSBI – 221: Computer Networks and Advanced Multimedia
Basic concepts of Computer Networks, Networks Hardware, Networks Software, ISOOSI reference models, Protocols. The Physical and Data Link Layer, Different transmission media, Data Link Layer Design Issues, Error Detection and Correction. The Network Layer, Network Layer Design Issues, Shortest Path Routing, IP Protocols, IP Address. The Transport Layer, The Transport Service, Services Providing to the Upper Layers, Transport Services Primitives, Internet Programming ToolsHTML, DHTML, JavaScript: Designing and developing web pages using HTML/DHTML. Overview of Multimedia: Different Views and Perspectives; Multimedia Applications: Videoondemand, Interactive television, Video conferencing, Hypermedia courseware, Groupware, World Wide Web, Digital libraries; Media and its Types: Static, Continuous, TimeBased.; Visual Communication and Human Perception: visual and auditory; Multimedia Devices and Systems: Audio/Video boards, Support for Media, CMS services; User Interface Design: Need, Significance, Guidelines etc. Multimedia Development: Phases, Stages, Team etc, Multimedia Building Blocks, Text: Value and significance, Formats, style, use in MM Networking and Applications, Streaming Audio and Video, Best Effort Service, Protocols for Real Time Applications, RTP, RTCP, SIP etc.; Graphics: Significance, different Image Formats, Scheduling Policing Mechanism, Integrated services and Differentiated services
Text / Reference Books:
th
- Computer Networks by A.S. Tanenbaum, 4Edition, Pearson Education, 2000
- Multimedia – An Introduction by Villmil:, Prentice Hall (India), 1999
nd
3. Computer Networking by J.F. Kurose and Keith W. Ross, 2Edition, Pearson Education, 2001
MSBI – 231: Advanced Algorithm for Computational Biology
Algorithms and Complexity, Algorithm Design Techniques, Tractable versus Intractable Problems. Restriction Mapping, Graphs, Measuring Fragment Size, Restriction Mapping Algorithms, Double Digest Problem, Classifying multiple solutions, NPCompleteness, Regulatory motifs in DNA sequences, Profiles, Motif finding Greedy algorithms, Divide and Conquer Algorithms. Markov Chains, Hidden Markov Models, CGIslands, Forward & Backward Algorithm, HMM Parameter estimation, Profile HMM, Building viterbi Algorithm, Multiple Sequence Alignment by profile HMM’s. DNA Sequencing, Shortest Superstring Problem, DNA Arrays as an Alternative Sequencing Technique, Sequencing by Hybridization, SBH as a Hamiltonian Path Problem, SBH as an Eulerian Path Problem, Fragment assembly in DNA Sequencing. Phylogenetic Reconstruction, Gene Expression Analysis, Hierarchical Clustering, kMean Clustering, Distance Based Tree Reconstruction, Aditive Metrics, Metrics on Tree, Ultrametric Trees, CharacterBased Tree Reconstruction, Parsimony Method, Evolutionary Methods, Maximum Likelihood Method, Model Comparison. Protein Folding: Levels of Proteins Structures, Prediction by Profile HMMs, Molecular Modeling.
Text / Reference Book:
- An Introduction to Bioinformatics Algorithms, Neil C Jones & Pavel A. Pevzner, Ane Books, 2005.
- Introduction to Computational Molecular Biology , Setubal and Meidanis, Thomson, 2003.
- Biological Sequence Analysis Probablistc Models of Proteins and Nucleic Acid, R. Durbin, Eady, A. Krogh and Mitchison, Cambridge University Press, 1998.
- Introduction to Mathematical Method in Bioinformatics, Alexcender Isaev, Springer Universitext, 2004
MSBI – 241: Gene Expression & MicroArray Analysis
DNA MicroArray: The Technical Foundations, Why are MicroArray Important? What is a DNA MicroArray. Designing a MicroArray Experiment: The Basic steps, Types of MicroArray, NCBI and MicroArray Data Management, GEO (Gene Expression Omnibus), MAML, The benefits of GEO and MAML, The Promise of MicroArray Technology in Treating Disease. MicroArray Data, Preprocessing the Data, Measuring Dissimilarity of Expression Pattern, Distance Motifs and Dissimilarity measures, Visualizing MicroArray Data; Principal Component Analysis, PCA and MicroArray Data. KMeans Clustering, Hierarchical Clustering, Self Organization Maps (SOM). Identifying Genes: Expressed usually in a sample, Expressed significantly in population, Expressed differently in two populations. Classifying Samples from two populations using Multilayer Perceptron, Support Vector Machines and their applications, Using genetic algorithm and perceptron for feature selection and supervised classification.
Text/Reference Books:w
- Microarray Data Analysis and Visualization by Arun Jogota, Bioinformatics, The Bay Press, 2001
- NCBI Science Premier on MicroArrays.
- Bioinformatics Methods and ApplicationGenomics, Proteomics and Drug Discovery by S. C. Rostogi, N. Mendiratla and P. Rostogi, Prentice Hall (India), 2004
MSBI251: Information Theory & Stochastic Process
Overview: Probabilities defined on events, Random variable, Binomial, Geometric, Poisson variates, Uniform, Exponential, Gamma, Normal Variates, Expectation. Introduction to stochastic process, Stationary process Poisson process, Nonhomogeneous Poisson process, Brownian motion process. Markov Chain: ChapmanKolmogorov equation, Classification of states, Limiting Probabilities, Time Reversible Markov Chain, Morkov Chain Monte Carlo Method, Continuous Time Marko Chain, Birth and Death Process, Time reversibility, Uniformization. Information and Entropy, Joint and Conditional Entropies, Mutual information, Chain rules for entropy, Relative entropy. Entropy rules of stochastic process, Entropy rate, Hidden Markov Models. Channel Capacity, Noiseless binary channel, Symmetric, Gaussian channel.
Text / Reference Book:
- Element of Information Theory by Cover and Thomas, New York: Wiley, 1991.
- A First Course in Stochastic Process by Samuel Karlin and Howard M. Taylor, Academic Press Inc. (Londan) Ltd., 1988.
- Stochastic Processes by Sheldon M. Ross, John Wiley & Sons, 2000. An Introduction to Stochastic Processes by Edward P. C. Kao, Duxbury, 1999
MSBI – 261: Computer Lab III
Perl programming for Motif search, Open reading frames, Parsing, etc. Signing and Creating Web pages using: HTML, Java Script. Multimedia: Introduction to Photoshop, Developing multimedia application using Flash. Gene expression and MicroArray: Preprocessing of datasets; using clustering and classification software such as SOM, SVM, PCA using visualization Software such as Tree View, Sammon Mapping
MSBI – 212: Major Project:
Project Guideline
- Students must submit his/her project synopsis to the concern supervisor along with his/her Name, Father’s Name, ID Number, Title of Project, etc and get it approved by them.
- Project must be done by individually or in group of maximum three students.
- Approval by the supervisor is based on the directions given by the Department.
- After scrutinizing the same by the supervisor, students must complete their project and submit the final report within the stipulated time in the form of soft and hard copy to the Department.
- The Department will evaluate project.
- Students must demonstrate their project in their Department to the concern persons on the date fixed by the learning center.
Objectives of the project proposed:
- To provide the scope to apply the concept of Bioinformatics, computer programming language, database used and application of computer science in biology.
- To analyze biological problems and provides good exposure to development & implementation of software database by following the software engineering concepts.
Marks Distribution:
• Internal Assessment: 100 Marks Project Supervisor: 40
Presentation & Viva Voce Internal 40 + 20 = 60
Total: 100
• Semester Assessment: 300 Marks Presentation & Viva Voce Final: 150 + 50 = 200 Project Report:
100 .. Total: 400
|