Missing features in Gramene release 19 data

Gregory Brown gregorysbrown at mac.com
Wed Jan 25 16:02:47 EST 2006


Hi,

It appears to me that all Rice "BAC", and "Gene Model" data  and some  
"Interpolated Gene", "mmp-unigene", "Maize EST" data is missing from  
both the mysqldump and tabbed data files for CMap release 19. Doing a  
search for all features that are 'bacs' or 'gene-model' from the CMap  
UI returns 0 entries when the same search at Gramene.org Map page  
returns 3438 and 3915 features repectively.  Looking in the Mysql  
database directly there are no entries that correspond to features  
that are 'bac' oe 'int-phen'and then searching either the  
cmap19.mysqldump or tabbed data files (cmap19-txt) show that there  
are no data fields for those feature types.


Would it be possible to re-export the CMap data either completely, or  
just the cmap_feature table?  Thanks.


mysql> SELECT * FROM cmap_feature WHERE feature_type_acc="gene-model"  
OR feature_type_acc = "bac";
Empty set (2.92 sec)

mysql> SELECT count(*) FROM cmap_feature WHERE feature_type_acc =  
"int-phen";
+----------+
| count(*) |
+----------+
|      710 |  There should be 3915 features of this type.
+----------+

mysql> SELECT count(*) FROM cmap_feature WHERE feature_type_acc =  
"mmp-unigene";
+----------+
| count(*) |
+----------+
|    32256 |  There are 39150 mmp-unigene features at Gramene.org
+----------+

mysql> SELECT count(*) FROM cmap_feature WHERE feature_type_acc =  
"maize-est";
+----------+
| count(*) |
+----------+
|     5052 |  There are 191,819 Maize EST features at Gramene.org
+----------+


< 47 > awk '{print $9}' cmap_feature.txt | egrep -c "^bac"
0
< 49 > awk '{print $9}' cmap_feature.txt | egrep -c "gene-model"
0

Greg Brown








More information about the Gramene mailing list