[Gmod-help] convert GFF files
Scott Cain
scott at scottcain.net
Mon Apr 20 11:07:29 EDT 2009
Hello Raul,
The sample you posted already looks very much like GFF3; is that the
sample you really meant to post?
The only thing I can see that makes this not GFF3 is that the item in
the third column (data_Day1_532:BLOCK1) is not a valid Sequence
Ontology type--and additionally, there is other data encoded in the
type which ought to go elsewhere (assuming it's important enough to
save). For example, I would dump the NimbleScan from the source
column and instead put what is in column 3 in column 2. Then for
column 3, I don't know exactly what you should put since I don't know
enough about your data, but I would guess "probe" would probably be
suitable, and if not probe, probably some other child of "reagent".
See
http://www.sequenceontology.org/miso/current_cvs/term/SO:0000051
for the Sequence Ontology term browser.
As for tools, there is no tool for converting between GFF and near-GFF
flavors: it is just too complicated a task for writing a generic tool.
However, given that the suggestions I just made are relatively
simple, you could probably do it in Excel or other spreadsheet
software and export as tab delimited text.
Scott
On Mon, Apr 20, 2009 at 10:48 AM, Petrin, Raul <raul-petrin at uiowa.edu> wrote:
> Hi
>
> I know that you got a lot of email regarding Convert GFF to GFF3, but I was
> reading but I didn’t found this full information.
>
> I have one file that I would like to convert to GFF3. Example.
>
>
>
> ##gff-version 3
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 11 61 1406.89 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000011;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 29 87 1286.67 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000029;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 62 118 1244.11 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000062;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 87 142 1321.22 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000087;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 114 168 1431.89 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000114;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 151 207 1098.56 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000151;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 179 241 1658.33 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000179;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 197 255 1747.22 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000197;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 225 282 1359.78 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000225;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 253 310 2027.00 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000253;count=1
>
> SCAFFOLD_1000 NimbleScan data_Day1_532:BLOCK1
> 291 350 1631.78 .
> .
> seq_id=SCAFFOLD_1000:1-3586;probe_id=SCAFFOLD_1000FS000000291;count=1
>
>
>
> Which tools can I use to convert?
>
> Thank you so much.
>
>
>
> Raul Petrin
>
> 2121 ML - 384-1015
>
> University of Iowa
>
>
>
> Notice: This UI Health Care e-mail (including attachments) is covered by the
> Electronic Communications Privacy Act, 18 U.S.C. 2510-2521, is confidential
> and may be legally privileged. If you are not the intended recipient, you
> are hereby notified that any retention, dissemination, distribution, or
> copying of this communication is strictly prohibited. Please reply to the
> sender that you have received the message in error, then delete it. Thank
> you.
--
------------------------------------------------------------------------
Scott Cain, Ph. D. scott at scottcain dot net
GMOD Coordinator (http://gmod.org/) 216-392-3087
Ontario Institute for Cancer Research
More information about the Gmod-help
mailing list