[Gmod-help] GMOD Membership of the GNPAnnot Chado Controller ?
Stéphanie Sidibé--Bocs
stephanie.sidibe-bocs at cirad.fr
Tue Jan 3 18:25:48 EST 2012
Dear Scott,
In the framework of the GNPAnnot project (2008-2011), we have been
developing a component based on Chado PostgreSQL database schema: the
Chado Controller (CC). This PgSQL layer was presented at the January
2010 GMOD Meeting:
http://gmod.org/w/images/5/54/Jan2010GnpAnnotURGI.pdf
Our manuscript is under review (Guignon, Droc et al. Revised in 2011,
BIOINF-2011-1444) (here enclosed and confidential).
We would like to nominate the CC for a new GMOD software component
indeed we think it meets all the requirements.
The CC meets common needs in community annotation systems (CAS). It
allows three types of genomic feature monitoring:
1) Access restriction to ensure the confidentiality of predefined
scaffolds and features to the annotators;
2) Manual annotation Inspector to take advantage of controlled
vocabularies and control the work of the curator when submitting
annotation modifications through a genome editor and
3) Annotation history to ensure the monitoring of annotation history for
any modified feature.
It is used on the banana complete genome sequenced with NGS.
The need to monitor genomic feature is a common and longstanding need
that will outlive any particular CAS. So, the CC is going to be useful
over time. Indeed, the amount of sequenced genomes increase
exponentially. Thus, we are going to manage more and more genomic
features that need to be kept of quality and up to date.
The CC is configurable and extensible. Indeed it is made up of three
modules that can be independently installed, configured, deactivated
(compatibility mode).
1) The access restriction module uses views, triggers, rules and
functions. It works actually on the Chado feature table but it can be
extended to other tables being inspired by the
“install_access_restriction.tmpl” PgSQL script.
2) The annotation Inspector module calls checking PgSQL procedures
listed in the table “annotation_inspector_procedures” of the modified
Chado database. Each record of the table can be modified to enable or
disable a function (e.g. “check_te_structure”). New functions could be
added being inspired by already existing functions (e.g.
“check_gene_structure”) and knowing how the annotation history module
works (read the CC technical documentation here enclosed).
3) The annotation history module is based on the Chado Audit module.
History module is installed using the current Chado schema of the CAS
and not a static given schema. This made it independent of Chado
versions. Configuration of the history detail is made as for the
“gbrowse_details” through the modification of the PERL script
“gbrowse_history”.
The CC uses the same open source license for all users than GBrowse and
Chado, i.e. Artistic Licence 2.
The CC is obviously interoperable with other GMOD components as it is
middleware between a Chado database and client Software (e.g. GBrowse,
Artemis, Apollo).
For the access restriction module working with GBrowse 1.70, a patch
needs to be installed. For GBrowse 2.40, an authentication module is now
available but has not been extensively tested yet. For Artemis 13.2.8 a
patch needs to be installed. For Apollo 1.11.4, a new data adapter has
been added. Concerning administrative tasks, a compatibility mode was
developed. For instance, two problems occurred when loading data in the
Chado database with the “gmod_bulk_load” utility: (i) the PostgreSQL
command "COPY ... FROM ..." does not work with views and the feature
view is used by the access restriction module and (ii) the triggers of
the annotation Inspector should not be called during data loading. Two
solutions were set up: (i) if you do not need to simultaneously do data
loading and manual curation, you switch on the compatibility mode
(“cc_compatibility.pl” restore the “feature” table instead of the
“feature” view of access restriction module and disable the triggers of
the annotation Inspector module) or (ii) if you need to simultaneously
do data loading and manual curation, you take advantage of the modified
version gmod_bulk_load that works on “feature_data” table instead of
feature and you disable the triggers of the Inspector (“SELECT
set_ai_compatibility(TRUE);”). In both cases, the compatibility mode is
switched off after data loading.
All GMOD components working with Chado were not tested, but in
principle, the CC architecture implied that it could work with all
components with only slowing down in the worst case. To avoid slowing
down, the GMOD component just has to call the optimization function
“init_access” at the beginning of each PostgreSQL session. However, the
CC could not be compatible with software intrinsically linked to the
Chado schema and which need the “feature” table (i.e. software that use
advance functions based on PostgreSQL system tables to obtain
information on the Chado table typically “SELECT * FROM pg_class WHERE
relname = 'feature' AND relkind = 'r';”).
The developers of the Chado Controller (Valentin Guignon, Gaëtan Droc,
Mathieu Rouard and Stéphanie Bocs) are willing to make a good faith
commitment to support the Chado Controller for at least 2 years after
joining GMOD (setting up email lists, responding to user questions,
writing and maintaining documentation, and adding new features and bug
fixes to the code).
We are also open to new developers from the GMOD community contributing
code and other support to the new component.
We have created the gnpannot-users at cirad.fr mailing list that we could
publicly archived if the CC become a part of the GMOD suite.
We have created a public SVN repository
http://gmod:@svn-southgreen.cirad.fr/svn/chadocontroller/branches/installer_package/
(login gmod no password) and documentation are available the GNPAnnot
Web site http://www.gnpannot.org/content/chado-controller-doc
We can also migrate our code on the GMOD repository if needed.
Please could you evaluate the Chado Controller? If you decide that it is
suitable to become officially part of GMOD and you are interested in
integrating Chado schema modifications for the CC you can ask
suggestions directly to Valentin Guignon.
Guignon, V., G. Droc, et al. (Revised in 2011, BIOINF-2011-1444). "Chado
Controller: advanced annotation management with a community annotation
system." Bioinformatics.
Happy new year.
--
UMR AGAP - Cirad TA A-96 / 03 (Bt. 3, Bur. 12)
Av. Agropolis 34398 Montpellier Cedex 5
Phone +33 4 67 61 56 31 Fax +33 4 67 61 56 05 Cell phone +33 6 84 97 85 30
skype: sibocs
http://southgreen.cirad.fr/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: chado_controller_technical_documentation_v1.0.0.pdf
Type: application/acroread
Size: 73823 bytes
Desc: not available
URL: <http://brie4.cshl.edu/pipermail/gmod-help/attachments/20120104/34e75051/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Chado_Controller_publication_20120301_28_SSB_clean.pdf
Type: application/acroread
Size: 815909 bytes
Desc: not available
URL: <http://brie4.cshl.edu/pipermail/gmod-help/attachments/20120104/34e75051/attachment-0001.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: supplementary_file_20120103_13_VG_SSB_clean.pdf
Type: application/acroread
Size: 234569 bytes
Desc: not available
URL: <http://brie4.cshl.edu/pipermail/gmod-help/attachments/20120104/34e75051/attachment-0002.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: stephanie_sidibe-bocs.vcf
Type: text/x-vcard
Size: 514 bytes
Desc: not available
URL: <http://brie4.cshl.edu/pipermail/gmod-help/attachments/20120104/34e75051/attachment.vcf>
More information about the Gmod-help
mailing list