[Gmod-help] Bio::DB::SeqFeature::Store issue
Ian Holmes
ihh at berkeley.edu
Thu Mar 19 14:43:34 EDT 2009
Hi Scott, thanks for helping out...
The "##gff-version 3" line didn't make any difference, but adding a
"##sequence-region ctgA 1 100000" line did help.
I was hoping to use this in an application that would be forgiving about
poorly-formed input data (specifically the TWiki plugin for JBrowse). I
guess I can quite easily sanitize the input GFF (i.e. fake a
##sequence-region line for every seqid)... but I'd like to check that
this is, indeed, necessary rather than duplicating effort..
Cheers,
Ian
Scott Cain wrote:
> Hi Ian,
>
> That is rather bizarre behavior you're describing. I have a few
> questions to get us started:
>
> Is there a line that describes what ctgA is (either a
> ##sequence-region line or a GFF line)?
>
> Does the file start with ##gff-version 3 ? I think the seqfeature
> loader is somewhat particular about this.
>
> Scott
>
>
> On Thu, Mar 19, 2009 at 2:29 PM, Ian Holmes <ihh at berkeley.edu> wrote:
>> Hi, not sure if GMOD is the right place to direct this query, apologies
>> if not.
>>
>> I'm having some problems with Bio::DB::SeqFeature::Store and I hoped
>> someone better-versed in the internals might be able to help.
>>
>> The following GFF file seems to work fine with the "features" method of
>> Bio::DB::SeqFeature::Store (NB there is a tab at the end of each line,
>> denoting an empty group field; I have also attached this as a GFF file)
>>
>> ctgA bed2gff feature 1659 1984 . . .
>> ctgA bed2gff feature 3014 6130 . . .
>> ctgA bed2gff feature 1 50000 . . .
>>
>> For example, the following Perl correctly calculates & prints the total
>> number of features in the file (3):
>>
>> use Bio::DB::SeqFeature::Store;
>> my $db = Bio::DB::SeqFeature::Store->new
>> (-adaptor => "memory",
>> -dsn => "testbed.gff");
>> my @features = $db->features (-seqid => "ctgA");
>> print @features+0;
>>
>>
>>
>> However, if I delete the last line of the GFF file (i.e. the feature
>> from 1 to 50000), it doesn't work (it thinks there are zero features in
>> the file). It also doesn't seem to mind if I change the coordinates of
>> the last feature, as long as the endpoint is vaguely "large" (e.g. 49000
>> works, but 7000 doesn't).
>>
>>
>> Anyone have any ideas? Is this something to do with binning and/or
>> default start/endpoints in the feature query method?
>>
>>
>> Ian
>>
>> ctgA bed2gff feature 1659 1984 . . .
>> ctgA bed2gff feature 3014 6130 . . .
>> ctgA bed2gff feature 1660 7000 . . .
>>
>>
>
>
>
More information about the Gmod-help
mailing list