Skip to contents

Dataset Discovery

Find and explore available datasets

list_datasets()
List Available Datasets
dataset_info()
Get Dataset Information
installed_datasets()
List Installed Datasets

Discovery Helpers

Filter datasets by their contents

discovery_helpers
Dataset Discovery Helpers
has_predictor()
Check if Predictors Contains Value
has_all_predictors()
Check if Predictors Contains All Values
has_format()
Check if Formats Contains Value
is_small()
Check if Dataset is Small
is_large()
Check if Dataset is Large
size_between()
Check if Size is Within Range
has_min_structures()
Check if Structure Count is At Least N
has_max_structures()
Check if Structure Count is At Most N
title_contains()
Check if Title Contains Pattern
description_contains()
Check if Description Contains Pattern
is_recent()
Check if Dataset is Recent

Dataset Installation

Download and manage datasets

install_dataset()
Install a Dataset
remove_dataset()
Remove a Dataset
is_installed()
Check if Dataset is Installed
get_dataset_path()
Get Dataset Path

Data Loading

Load metadata and predictions

load_dataset()
Load Combined Metadata and Predictions
load_metadata()
Load Dataset Metadata
load_predictions()
Load Prediction Scores
load_datapackage()
Load Dataset Manifest
query_structures()
Query Structures

Structure Access

Extract structure files and PAE matrices

get_structure()
Get Structure File(s)
get_pae()
Get PAE Matrix
get_plddt()
Get pLDDT Scores
clear_extracted()
Clear Extraction Cache

Cross-Dataset Operations

Search and compare across datasets

search_structures()
Search Structures
find_protein()
Find Structures by Protein ID
find_best()
Find Best Structure
list_columns()
List Columns Across Datasets
compare_datasets()
Compare Datasets

Validation

Validate TSP packages

validate_tsp()
Validate a TSP Dataset
print(<tsp_validation>)
Print method for tsp_validation

Cache Management

Manage local cache

get_cache_dir()
Get Cache Directory
cache_size()
Cache Size
clear_cache()
Clear Cache