Repository logo
 
Publication

OpenDataHub: an open dataset management system

datacite.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapt_PT
dc.contributor.advisorNunes, Duarte Nuno Jardim
dc.contributor.advisorQuintal, Filipe
dc.contributor.advisorPereira, Lucas
dc.contributor.authorGonçalves, Orêncio Rodolfo Abreu
dc.date.accessioned2017-01-04T12:11:56Z
dc.date.available2017-02-04T01:30:09Z
dc.date.issued2016-06
dc.description.abstractThis thesis presents a cloud-based software platform for sharing publicly available scientific datasets. The proposed platform leverages the potential of NoSQL databases and asynchronous IO technologies, such as Node.JS, in order to achieve high performances and flexible solutions. This solution will serve two main groups of users. The dataset providers, which are the researchers responsible for sharing and maintaining datasets, and the dataset users, that are those who desire to access the public data. To the former are given tools to easily publish and maintain large volumes of data, whereas the later are given tools to enable the preview and creation of subsets of the original data through the introduction of filter and aggregation operations. The choice of NoSQL over more traditional RDDMS emerged from and extended benchmark between relational databases (MySQL) and NoSQL (MongoDB) that is also presented in this thesis. The obtained results come to confirm the theoretical guarantees that NoSQL databases are more suitable for the kind of data that our system users will be handling, i. e., non-homogeneous data structures that can grow really fast. It is envisioned that a platform like this can lead the way to a new era of scientific data sharing where researchers are able to easily share and access all kinds of datasets, and even in more advanced scenarios be presented with recommended datasets and already existing research results on top of those recommendations.pt_PT
dc.identifier.tid201329344
dc.identifier.urihttp://hdl.handle.net/10400.13/1322
dc.language.isoengpt_PT
dc.subjectCiência e tecnologia informáticaspt_PT
dc.subjectEngenhariapt_PT
dc.subjectPlataformaspt_PT
dc.subjectComputaçãopt_PT
dc.subjectDadospt_PT
dc.subjectDatasetpt_PT
dc.subjectSistema de gerenciamentopt_PT
dc.subjectConjunto de dadospt_PT
dc.subjectOpenDataHubpt_PT
dc.subjectLinguagens informáticaspt_PT
dc.subjectLCABpt_PT
dc.subjectMySQLpt_PT
dc.subjectMongoDBpt_PT
dc.subjectSoftwarept_PT
dc.subjectBenchmarkpt_PT
dc.subjectInformatics Engineeringpt_PT
dc.subjectComputer Sciencept_PT
dc.subject.pt_PT
dc.subjectFaculdade de Ciências Exatas e da Engenhariapt_PT
dc.titleOpenDataHub: an open dataset management systempt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsopenAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameMaster in Informatics Engineeringpt_PT

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MestradoOrêncioGonçalves.pdf
Size:
5.95 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: