Identification of novel regulatory elements in sequenced genomes by clustering and other data mining methods

Cancella i cookie per la scelta della lingua

Ricerca

Ricerca semplice

Ricerca avanzata

Ultime accessioni

Scorri per

Autore

Settori scientifico-disciplinari

Anno

Tipologia prodotto

Accessibilità del full-text

Informazioni

Policy

Informazioni su fedOA

FAQ

Contatti

Cozzuto, Luca (2009) Identification of novel regulatory elements in sequenced genomes by clustering and other data mining methods. [Tesi di dottorato] (Inedito)

Anteprima

PDF
Cozzuto_Luca.pdf
Download (9MB) | Anteprima

Tipologia del documento:	Tesi di dottorato
Lingua:	English
Titolo:	Identification of novel regulatory elements in sequenced genomes by clustering and other data mining methods
Autori:	Autore Email Cozzuto, Luca cozzuto@ceinge.unina.it
Data:	31 Marzo 2009
Numero di pagine:	107
Istituzione:	Università degli Studi di Napoli Federico II
Istituzioni (extra):	CEINGE Biotecnologie Avanzate, TIGEM – Telethon Insitute of Genetics and Medicine
Dipartimento:	CEINGE Biotecnologie avanzate
Scuola di dottorato:	SEMM – European School of Molecular Medicine
Dottorato:	PhD in Molecular Medicine (Molecular Oncology or Human Genetics)
Ciclo di dottorato:	20
Coordinatore del Corso di dottorato:	nome email Salvatore, Francesco [non definito]
Tutor:	nome email Salvatore, Francesco salvator@unina.it Paolella, Giovanni paolella@ceinge.unina.it Gibson, Toby [non definito]
Data:	31 Marzo 2009
Numero di pagine:	107
Parole chiave:	stem-loop, secondary structure prediction, RNA
Settori scientifico-disciplinari del MIUR:	Area 05 - Scienze biologiche > BIO/11 - Biologia molecolare
Informazioni aggiuntive:	Ciclo II/XX, Curriculum Molecular Oncology
Depositato il:	17 Nov 2009 09:26
Ultima modifica:	14 Gen 2015 12:28
URI:	http://www.fedoa.unina.it/id/eprint/3406
DOI:	10.6092/UNINA/FEDOA/3406

Abstract

In bacterial genomes a fraction of transcribed sequences do not code for proteins or structural RNAs, but have been shown to be involved in fundamental processes, such as regulation of gene expression, mRNA processing and stability or structural RNA maturation. In this thesis a systematic procedure to identify and classify families of repeated sequences sharing a common RNA secondary structure was applied to the study of 40 bacterial genomes. Sequences able to fold in a stable stem loop structure were clustered according to sequence similarity, and grouped within homogeneous families. The study led to the identification of 57 families of repeated sequences, sharing a common secondary structure and potentially coding for structured RNAs. All previously known such families have been detected by the used procedure, and are listed within the final set, together with 37 novel ones. Their location in relation to protein coding genes was evaluated, and a correlation was found between structure and positioning within intergenic regions. A new software tool is also described, Scaffolder, designed to help in high-throughput de novo genome sequencing by finding connections between contigs produced by random shotgun sequencing, and assisting the researcher in the whole process. The software, accessible both as a command line tool and as a web application, can guide all the final phases of genome assembly by storing the current assembly status, displaying networks of connected contigs and untangling multiply connected ones by a combination of computational and experimental procedures.

Downloads

Downloads per month over past year

Actions (login required)

Modifica documento

Università di Napoli - Centro di Ateneo per le Biblioteche

fedOA è realizzato con