Skip to main content

Home/ OpenSciInfo/ Group items tagged sequence

Rss Feed Group items tagged

Mike Chelen

http://vancouvershortr.sourceforge.net/ - 0 views

  •  
    This package of tools encompasses many of the common pieces of software required for the analysis of short read sequences produced by the Second Generation DNA sequencing machines. (eg. Illumina/Solexa sequencers, ABI SOLiD and 454). The focus of this project is on post-alignment analysis, thus the input for this process should be the files produced by sequence aligners such as MAQ, Eland or Exonerate. The output should be provided in several formats, including BED and WIG files which are readable by the UCSC Genome Browser.
Mike Chelen

EST clusters - 0 views

  •  
    We build here a repository of assembled transcript sequences from the contigation (Expressed Sequence Tag (known as EST) & mRNA) in order to discover new genes from already existing data. Publicly available EST & mRNA sequences are clusterised and then contigated with specific bioinformatic tools (see technology).
Mike Chelen

SourceForge.net: CloudBurst - cloudburst-bio - 0 views

  •  
    CloudBurst: Highly Sensitive Short Read Mapping with MapReduce Michael Schatz Center for Bioinformatics and Computational Biology, University of Maryland Next-generation DNA sequencing machines are generating an enormous amount of sequence data, placing unprecedented demands on traditional single-processor read mapping algorithms. CloudBurst is a new parallel read-mapping algorithm optimized for mapping next-generation sequence data to the human genome and other reference genomes, for use in a variety of biological analyses including SNP discovery, genotyping, and personal genomics. It is modeled after the short read mapping program RMAP, and reports either all alignments or the unambiguous best alignment for each read with any number of mismatches or differences. This level of sensitivity could be prohibitively time consuming, but CloudBurst uses the open-source Hadoop implementation of MapReduce to parallelize execution using multiple compute nodes. CloudBurst's running time scales linearly with the number of reads mapped, and with near linear speedup as the number of processors increases. In a 24-processor core configuration, CloudBurst is up to 30 times faster than RMAP executing on a single core, while computing an identical set of alignments. In a large remote compute clouds with 96 cores, CloudBurst reduces the running time from hours to mere minutes for typical jobs involving mapping of millions of short reads to the human genome. CloudBurst is available open-source as a model for parallelizing other bioinformatics algorithms with MapReduce.
Mike Chelen

SourceForge.net: Vancouver Short Read Analysis Package - 0 views

  •  
    This package contains code for use with Short Read DNA Sequencing technologies, and includes packages for ChIP-Seq, Whole Transcriptome Shotgun Sequencing, Whole Genome Shotgun Sequencing, SNP Detection, Transcript expression and file conversion.
Mike Chelen

Bowtie: An ultrafast, memory-efficient short read aligner - 0 views

  •  
    Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome at a rate of 25 million reads per hour on a typical workstation with 2 gigabytes of memory. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: 1.3 GB for the human genome. It supports alignment policies equivalent to Maq and SOAP but is much faster: about 35x faster than Maq and over 350x faster than SOAP when aligning to the human genome.
Mike Chelen

FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short... - 0 views

  •  
    FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology
Mike Chelen

SourceForge.net: Staden Package - 0 views

  •  
    A fully developed set of DNA sequence assembly (Gap4), editing and analysis tools (Spin) for Unix, Linux, MacOSX and MS Windows.
Mike Chelen

Bioinformatics Toolkit - 0 views

shared by Mike Chelen on 17 Dec 08 - Cached
  •  
    The Bioinformatics Toolkit is a platform that integrates a great variety of tools for protein sequence analysis. Many tools are developed in-house, and serveral public tools are offered with extended functionality. The toolkit includes, among others: NucleotideBLAST, ProteinBLAST, PSI-BLAST, fastHMMER, HHsenser; ClustalW, MUSCLE, Mafft, ProbCons; HHrep, PCOILS, REPPER; Quick2D; HHpred, Modeller; CLANS, ANCESCON, PHYLIP; Reformat, RetrieveSeq, gi2promoter. For a short description of the tools, click the section tabs.
Mike Chelen

UCSF Chimera Home Page - 0 views

shared by Mike Chelen on 11 Dec 08 - Cached
  •  
    UCSF Chimera is a highly extensible program for interactive visualization and analysis of molecular structures and related data, including density maps, supramolecular assemblies, sequence alignments, docking results, trajectories, and conformational ensembles. High-quality images and animations can be generated. Chimera includes complete documentation and several tutorials, and can be downloaded free of charge for academic, government, non-profit, and personal use. Chimera is developed by the Resource for Biocomputing, Visualization, and Informatics and funded by the NIH National Center for Research Resources (grant P41-RR01081).
Mike Chelen

Science 2.0 - introduction and perspectives for Poland « Freelancing science - 0 views

  • transcript of Science 2.0 based on a presentation I gave on conference on open science organized in Warsaw earlier this month
  • prepared for mixed audience and focused on perspectives for Poland
  • new forms of communication between scientists
  • ...44 more annotations...
  • research become meaningful only after confronting results with the scientific community
  • peer-reviewed publication is the best communication channel we had so far
  • new communication channels complement peer-reviewed publication
  • two important attributes in which they differ from traditional models: openness and communication time
  • increased openness and shorter communication time happens already in publishing industry (via Open Access movement and experiments with alternative/shorter ways of peer-review)
  • say few words about experiments that go little or quite a lot beyond publication
  • My Experiment as an example of an important step towards openness
  • least radical idea you can find in modern Science 2.0 world
  • virtual research environment
  • focus is put on sharing scientific workflows
  • use case
  • diagram of the “methods” sections from experimental (including bioinformatics analyses) publications
  • make it easier for others to understand what we did
  • can open towards other scientists we can also open towards non-experts
  • people from all over the world compete in improving structural models of proteins
  • helps in improving protein structure prediction software and in understanding protein folding
  • combine teaching and data annotation
  • metagenome sequences in first case and chemistry spectra in the second
  • interactive visualizations of chemical structures, genomes, proteins or multidimensional data
  • communicate some difficult concepts faster
  • new approaches in conference reporting
  • report in real time from the conference
  • followed by a number of people, including even the ones that were already on the conference
  • “open notebook science” which means conducting research using publicly available, immediately updated laboratory notebook
  • The reason I did a model for Cameron’s grant was that I subscribed to his feed before
  • I didn’t subscribe to Cameron because I knew his professional profile
  • I read his blog, I commented on it and he commented on mine, etc.
  • participation in online communities
  • important part of Science 2.0 is the fact that it has human face
  • PhDs about the same time
  • first was from a major Polish institute, the second from a major European one
  • what a head of a lab both would apply to will see
  • gap we must fill, this is between current research and lectures we give today
  • access to real-time scientific conversation
  • follow current research and decide what is important to learn
  • synthetic biology
  • not all universities in world have synthetic biology courses
  • didn’t stop these students, and they plan to participate in IGEM again
  • not only scientists – there are librarians, science communicators, editors from scientific journals, people working in biotech industry
  • community of life scientists
  • even people without direct connection to science
  • diverse skills and background
  • online conference
  • interact with them and to learn from them
1 - 11 of 11
Showing 20 items per page