Skip to content

Instantly share code, notes, and snippets.

View d33bs's full-sized avatar
🪴
Software gardening...

Dave Bunten d33bs

🪴
Software gardening...
View GitHub Profile
@d33bs
d33bs / scratch
Last active September 4, 2024 20:04
curl https://pyenv.run | bash
pyenv install 3.11
pip install poetry
git clone -b jump-umap-example https://github.com/d33bs/coSMicQC
cd coSMicQC
poetry install
poetry run jupyter lab
"""
Seeks to query and extract biological process data from OLS4
using web API, Pandas, and Parquet.
API reference:
https://www.ebi.ac.uk/ols4/swagger-ui/index.html
Lab or group: Greene Lab
"""
"""
Seek name and mechanism of action for drugs from Drugbank dataset.
Data reference:
https://go.drugbank.com/releases/latest#full
Lab or group: JRaviLab
"""
from typing import Dict, Any
import pandas as pd
import numpy as np
import pyarrow as pa
import pyarrow.parquet as pq
import os
# Constants for the table dimensions and chunk size
n_rows = 10000
n_cols = 5000
chunk_size = 1000