Learning Outcomes
- Utilize Data sources to create Datasets.
- Identify different types of Data Sources supported by Via Foundry.
- Learn key features of Datasets.
Datasets
Datasets serve as the starting point for managing and processing your data within the platform.
Types of Data Sources
Via Foundry supports a variety of data sources to cater to diverse user needs:
| Data Source Type | Description |
|---|---|
| Cloud Storage Providers | This includes popular cloud platforms such as AWS S3, Google Cloud Storage, and Azure Blob Storage. |
| Local Storage | Access files stored directly on your local High performance computing (HPC) cluster. |
| Public Repositories | Utilize publicly available datasets hosted online, including sources such as ENCODE or GEO databases. |
| URL | Access datasets available through URLs and FTPs. |
Features of Datasets in Via Foundry
Security
Establishing a secure connections to data through data source is a priority within Via Foundry. Users are required to provide appropriate credentials to access their data sources. These credentials are encrypted and securely stored to prevent unauthorized access.
Reusable Connections
Once a data source is connected, you can save the connection as bookmark for future use, eliminating the need to re-enter credentials or connection details every time.