GEO (at NCBI) and ArrayExpress (at EBI) are the two public microarray data management repositories in the world, and currently there is no data exchange performed between them. The ArrayExpress project from the very start was standards-oriented with a standard object model and MAGE-ML format data loading. The MIAMExpress submission tool was built to enable translation between web pages filled by submitters and exports MAGE-ML. GEO defined their own data formatting requirements earlier. They accept data in SOFT (Simple Omnibus Format in Text) format which defines a set of linked spreadsheets and where data submitters can choose which fields to annotate. The aim of this project is to analyze data in GEO and to design a set of rules how data can be mapped to ArrayExpress infrastructure, in particular the data warehouse and to import some GEO data into ArrayExpress and to automate this process by providing good curators' tools. |