✓ All Systems Operational |
SERVICES / SERVERS
HALLEY | ||
Our Halley service is a remote server for KorcsmarosLab use only. | ||
Uptime: | ||
Actual status of the service: | Operational |
SLK3WEBSERVER | ||
Our SLK3 Webserver is a remote server for KorcsmarosLab use only. Service(s), which is running from this server: SLK3 |
||
Uptime: | ||
Actual status of the service: | Operational |
WEBHOSTING | ||
Our webhosting service is provided by 3 interconnected virtual machines located at the Earlham Institute in Norwich, UK. Service(s), which is running from this server cluster: ARN | NRF2OME |
||
Uptime: | ||
Actual status of the service: | Operational |
SHERLOCK CLUSTER | ||
Our Sherlock cluster is provided by 3 interconnected virtual machines located at the Earlham Institute in Norwich, UK. Service(s), which is running from this server cluster: Sherlock |
||
Uptime: | ||
Actual status of the service: | Operational |
SHERLOCK | ||
Sherlock is an open source data platform, developed in the Korcsmaros Group (Earlham Institute, Norwich, UK) to store, analyze and integrate bioinformatics data. With Sherlock: 1) you can store all datasets in a redundant, organized cloud storage, 2) convert all datasets to common, optimized file formats, 3) execute analytical queries on top of data files, 4) share datasets among different teams / projects, 5) generate operational datasets for certain services or collaborators. Sherlock is using standard big data technologies to store the biological data and to analyze it. The main concept can be seen in the following figure (don’t be confused, every word on it will be explained later). Sherlock follows industrial best practices in its architecture. Usually the main idea behind these scalable batch processing architectures is the separation of data storage and analytics, as you can see below. This allows you to scale the analytical power and the storage size independently from each other and even dynamically, if you are deploying Sherlock in the cloud. Sherlock is freely available on github. | ||
Uptime: | ||
Actual status of the service: | Operational |
DR. WATSON | ||
Our Dr. Watson service is a Data Lake, where we are storing our different datafiles for our Sherlock Big Data platform. This data lake is operated by Digital Ocean (S3). In our Data Lake, we have 4 different zones/layers, to store data: 1) the raw zone, where we are storing the raw data, what was downloaded from the different freely available databases. 2) the landing zone, where we store our data in a compact and common json format. 3) the master zone, where we store our data in a common and really good quality formatted file format, called ORC (Optimized Row Columnar). 4) the project zone, where we store specific data for our different projects. Right now, in our Data Lake we have different datatypes: interaction data, expression data, sequence data, annotation data. | ||
Actual status of the service: | Operational |
Databases/datasources in our Data Lake |
Expression data |
Bgee database | actual version: 05/04/2021 |
Interaction data |
BioPlex database | actual version: 27/04/2020 |
Dorothea database | actual version: 27/01/2022 |
HINT database | actual version: 08/09/2021 |
HuRi database | actual version: 16/11/2020 |
InBioMap database | actual version: 24/05/2019 |
IntAct database | actual version: 27/01/2022 |
IRefIndex database | actual version: 08/09/2021 |
Mentha database | actual version: 24/01/2022 |
Omnipath database | actual version: 27/01/2022 |
String database | actual version: 09/09/2021 |
Other datatypes |
DBSnp | actual version: 26/06/2019 |
Gene Ontology | actual version: 27/01/2021 |
GO Annotations | actual version: 27/01/2021 |
Human Genome | actual version: GRCh38.p13 |
Uberon Gene Ontology | actual version: 13/04/2021 |
UniProt ID Mapping Table | actual version: 04/2021 |
WEBSERVICES
SLK3 | ||
SignaLink 3 is an integrated resource to analyze signaling pathway cross-talks, transcription factors, miRNAs and regulatory enzymes. Main features: 1) A signaling network resource with known and predicted information for human and model organisms. 2) Manually curated dataset of major signaling pathways - including curated data from resources such as ACSN, InnateDB, Reactome and Signor. 3) Extends pathways with integrated regulatory resources to contain pathway-specific transcription factors, miRNA, scaffolds and post translational modifying enzymes. 4) Proteins are classified by pathway position (core/non-core) and function (ligand, receptor, mediator, etc.). 5) Signaling interactions are directed and labeled with PubMed IDs of the publications of experimental evidence. 6) A multi-layered network structure allows the selection of user-specific details. 7) Allows filtering based on tissue or sub-cellular localization. 8) Supporting downloads in csv, biopax (level 3), psimi tab, sbml or cytoscape formats. | ||
How to cite SLK3: | ||
SignaLink 2.0 - A signaling pathway resource with multi-layered regulatory networks Fazekas D*, Koltai M*, Türei D*, Módos D, Pálfy M, Dúl Z, Zsákai L, Szalay-Bekő M, Lenti K, Farkas I J, Vellai T, Csermely P, Korcsmáros T (* equal contributions) | BMC Systems Biology 2013, 7:7. | ||
Uniformly curated signaling pathways reveal tissue-specific cross-talks and support drug target discovery. Korcsmáros T *, Farkas I J *, Szalay M S, Rovó P, Fazekas D, Spiro Z, Böde C, Lenti K, Vellai T, Csermely P (* equal contributions) | Bioinformatics 26:2042-2050 (2010). | ||
Actual status of the service: | Operational |