<HTML>
<HEAD>
<TITLE>Re: TAIR9 vs TAIR10 - mixup on naming, was: RE: Old versions?</TITLE>
</HEAD>
<BODY>
<FONT FACE="Calibri, Verdana, Helvetica, Arial"><SPAN STYLE='font-size:11pt'>Okay, it sounds like gramene needs to correct this. <BR>
<BR>
Probably the person at gramene who downloaded the Arabidopsis data wasn’t aware that TAIR10 is really the same assembly as TAIR9. A lot of people get confused about this. Or, they might have checked the actual data and discovered a difference, despite what the documentation says.<BR>
<BR>
Jonathan, since you are already providing validation services, could you also provide some additional sanity checking of the data?<BR>
<BR>
In this case, you could write something that determines whether two reference assemblies with different names are in fact the same.<BR>
<BR>
In addition, you could write something else that checks that two DAS sources that serve the same genome sequence assemblies are indeed delivering the same data. Confusion could easily arise if one source is delivering masked sequence data but another one isn’t. <BR>
<BR>
Best,<BR>
<BR>
Ann<BR>
<BR>
On 6/28/11 12:13 PM, "Jonathan Warren" <<a href="jw12@sanger.ac.uk">jw12@sanger.ac.uk</a>> wrote:<BR>
<BR>
</SPAN></FONT><BLOCKQUOTE><FONT FACE="Calibri, Verdana, Helvetica, Arial"><SPAN STYLE='font-size:11pt'>Hi<BR>
<BR>
The DAS Registry automatically takes it's information from the gramene DAS sources document <a href="http://dev.gramene.org/gramenedas/das/sources">http://dev.gramene.org/gramenedas/das/sources</a>. If the reference coordinates are the same then the coordinate system should remain as TAIR 9. However if there are new sequences added then new entry_points need to be added to a the DAS reference sources. If the gramene sources document reverts to TAIR 9 then the registry will automatically reflect this. As all the data sources using the TAIR 10 coordinate system are gramenes no other problems should arise.<BR>
<BR>
On 28 Jun 2011, at 15:26, Loraine, Ann wrote:<BR>
<BR>
</SPAN></FONT><BLOCKQUOTE><FONT FACE="Calibri, Verdana, Helvetica, Arial"><SPAN STYLE='font-size:11pt'> <BR>
<BR>
<BR>
Greetings all,<BR>
<BR>
There is some confusion about the meaning of TAIR9 versus TAIR10.<BR>
<BR>
TAIR9 is both a genome assembly release and a genome annotation release, meaning: it includes both new sequences for Arabidopsis chromosomes and some revised gene models.<BR>
<BR>
TAIR10 is a genome annotation release only. The chromosomes did not change from TAIR9 to TAIR10 according to this README file:<BR>
<BR>
<a href="ftp://ftp.arabidopsis.org/home/tair/Sequences/whole_chromosomes/README_whole_chromosomes.txt">ftp://ftp.arabidopsis.org/home/tair/Sequences/whole_chromosomes/README_whole_chromosomes.txt</a><BR>
<BR>
quote*Please note that the chromosome files have NOT CHANGED FROM TAIR9 to TAIR10*unquote<BR>
<BR>
Thus, the gene structure annotations released in TAIR10 are using the same reference sequence as the gene structure annotations released with TAIR9.<BR>
<BR>
I've noticed the DAS registry contains both TAIR9 and TAIR10 as reference assembles and that some data sets (looks like alignments) are referencing TAIR10 chromosomes. This is incorrect as there is no TAIR10 genome assembly.<BR>
<BR>
Also, I would like to suggest using a different term for the TAIR9 assembly in order to avoid future confusion. Please use the term: A_thaliana_Jun_2009. This is what we are using to designate this genome assembly in the Integrated Genome Browser QuickLoad and DAS systems. It would be great if the DAS registry could either recognize this as a synonym for TAIR9 or use this term instead so that people will not continue to be confused about the meaning of TAIR*.<BR>
<BR>
Best wishes,<BR>
<BR>
Ann Loraine<BR>
____________________<BR>
Ann Loraine<BR>
Associate Professor<BR>
Dept. of Bioinformatics and Genomics, UNCC<BR>
North Carolina Research Campus<BR>
600 Laureate Way<BR>
Kannapolis, NC 28081<BR>
704-250-5750<BR>
www.transvar.org <<a href="http://www.transvar.org">http://www.transvar.org</a>> <BR>
<BR>
<BR>
<BR>
-----Original Message-----<BR>
From: Jonathan Warren [<a href="mailto:jw12@sanger.ac.uk">mailto:jw12@sanger.ac.uk</a>]<BR>
Sent: Tue 6/28/2011 8:46 AM<BR>
To: <a href="gramene@gramene.org">gramene@gramene.org</a><BR>
Subject: Old versions?<BR>
<BR>
Hi<BR>
<BR>
Do you still host old versions of gramene? if so where are they?<BR>
More specifically the DAS sources for say TAIR 9...7etc rather than <BR>
TAIR 10? If they exist I can register them with the DAS Registry and <BR>
they maybe useful for the DAS community and researchers?<BR>
<BR>
Thanks in advance<BR>
<BR>
Jonathan Warren<BR>
Senior Developer and DAS coordinator<BR>
blog: <a href="http://biodasman.wordpress.com/">http://biodasman.wordpress.com/</a><BR>
<a href="jw12@sanger.ac.uk">jw12@sanger.ac.uk</a><BR>
Ext: 2314<BR>
Telephone: 01223 492314<BR>
<BR>
<BR>
<BR>
<BR>
<BR>
<BR>
<BR>
<BR>
<BR>
--<BR>
The Wellcome Trust Sanger Institute is operated by Genome Research<BR>
Limited, a charity registered in England with number 1021457 and a<BR>
company registered in England with number 2742969, whose registered<BR>
office is 215 Euston Road, London, NW1 2BE.<BR>
<BR>
<BR>
<BR>
<BR>
</SPAN></FONT></BLOCKQUOTE><FONT FACE="Calibri, Verdana, Helvetica, Arial"><SPAN STYLE='font-size:11pt'><BR>
<BR>
</SPAN></FONT><FONT SIZE="1"><FONT FACE="Helvetica, Verdana, Arial"><SPAN STYLE='font-size:9pt'>Jonathan Warren<BR>
Senior Developer and DAS coordinator<BR>
blog: <a href="http://biodasman.wordpress.com/">http://biodasman.wordpress.com/</a><BR>
<a href="jw12@sanger.ac.uk">jw12@sanger.ac.uk</a><BR>
Ext: 2314<BR>
Telephone: 01223 492314<BR>
<BR>
<BR>
<BR>
</SPAN></FONT></FONT><FONT FACE="Helvetica, Verdana, Arial"><SPAN STYLE='font-size:12pt'><BR>
</SPAN></FONT><FONT FACE="Calibri, Verdana, Helvetica, Arial"><SPAN STYLE='font-size:11pt'><BR>
<BR>
<BR>
<BR>
<BR>
-- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. <BR>
<BR>
</SPAN></FONT></BLOCKQUOTE><FONT FACE="Calibri, Verdana, Helvetica, Arial"><SPAN STYLE='font-size:11pt'><BR>
-- <BR>
Ann Loraine<BR>
Associate Professor<BR>
Dept. of Bioinformatics and Genomics, UNCC<BR>
North Carolina Research Campus<BR>
600 Laureate Way<BR>
Kannapolis, NC 28081<BR>
704-250-5750 (office)<BR>
<a href="http://www.transvar.org">http://www.transvar.org</a><BR>
<BR>
</SPAN></FONT>
</BODY>
</HTML>