better DataArchitectureDiagram.md

Kaimbacher 2023-04-25 16:34:32 +02:00
parent ddf38c0d4a
commit 3389233939
2 changed files with 211 additions and 44 deletions

@ -7,9 +7,9 @@
TETHYS collects and manages scanned geological maps, spatial data sets, and other types of research outputs.
The general guideline for publishing research data <a href="https://tethys.at/docs/HandbuchTethys.pdf#page=12" target="_blank">(manual, p. 11-2)</a> is providing information on how to prepare data sources for deposit (helpful for depositors and repository staff):
- the types of file formats that are acceptable,
- the types of file formats that are acceptable. All **allowed file fomats** must be enabled by the administrator in the database table **'mime_types'**. Also see <a href="https://gitea.geologie.ac.at/geolba/tethys.backend/wiki/Database#tethys-lookup-tables" target="_blank">(db lookup tables)</a>
- what kind of data can be included in the research data repository,
- the file upload limits,
- the file upload limits (512MB),
- what CC licenses can be used,
- the strategy for deleting datasets,
- how to handle the use of external research data,
@ -22,13 +22,13 @@ The **Metadata format guidelines** <a href="https://tethys.at/docs/HandbuchTethy
# 3. Storage Infrastructure
This is the physical or virtual infrastructure used to store the data and metadata in the repository. By using **PostgreSQL**, TETHYS is able to manage large volumes of metadata and provide fast and secure access to this information. The data files are stored on an **Ubuntu file server** with **ext4**partition.
This is the physical or virtual infrastructure used to store the data and metadata in the repository. By using **PostgreSQL**, TETHYS is able to manage large volumes of metadata and provide fast and secure access to this information. The data files are stored on an **Ubuntu 22.04 file server** with **ext4**partition.
# 4. Data processing and curation workflows
...
# 5. Discovery and access systems
# 5. Data Discovery
TETHYS supports data discovery in various ways
@ -73,6 +73,10 @@ TETHYS provides an Access Control List (ACL) system that is used to manage users
- link_account_roles: This table stores the relationship between users (Accounts) and roles. Each row in this table represents a user that has been assigned a specific role.
# 7. Preservation systems
# 7. System Integration
....
System Integration with other systems involves integrating the research repository with other systems, such as project management tools, data platforms and reporting tools.
By providing the Open Archives Initiative Protocol for Metadata Harvesting (**OAI-PMH**) any other data provider can harvest TETHYS metadata. An example would be the **Bielefeld Academic Search Engine BASE**: https://www.base-search.net/Search/Results?q=coll:fttethysrdr&refid=dctablede
Matomo is used to track statistics data for TETHYS. Matomo is an open-source web analytics platform that can be used to track user behavior on a website.

File diff suppressed because one or more lines are too long

Before

Width:  |  Height:  |  Size: 104 KiB

After

Width:  |  Height:  |  Size: 124 KiB