en agent skills

apm::packages

License: MIT✕Language: English✕Verified Clear

@orchestra-research/weights-and-biases

Track ML experiments with automatic logging, visualize training in real-time, optimize hyperparameters with sweeps, and manage model registry with W&B - collaborative MLOps platform

★ 5,030MIT

Orchestra-Research/development·3,272 tokens

@orchestra-research/sentence-transformers

skill

Framework for state-of-the-art sentence, text, and image embeddings. Provides 5000+ pre-trained models for semantic similarity, clustering, and retrieval. Supports multilingual, domain-specific, and multimodal models. Use for generating embeddings for RAG, semantic search, or similarity tasks. Best for production embedding generation.

★ 5,030MIT

Orchestra-Research/development·1,574 tokens

@orchestra-research/nemo-guardrails

skill

NVIDIA's runtime safety framework for LLM applications. Features jailbreak detection, input/output validation, fact-checking, hallucination detection, PII filtering, toxicity detection. Uses Colang 2.0 DSL for programmable rails. Production-ready, runs on T4 GPU.

★ 5,030MIT

Orchestra-Research/development·1,898 tokens

@orchestra-research/grpo-rl-training

skill

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

★ 5,030MIT

Orchestra-Research/development·4,284 tokens

@orchestra-research/unsloth

skill

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

★ 5,030MIT

Orchestra-Research/development·492 tokens

@orchestra-research/long-context

skill

Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+ tokens), extending pre-trained models beyond original context limits, or implementing efficient positional encodings. Covers rotary embeddings, attention biases, interpolation methods, and extrapolation strategies for LLMs.

★ 5,030MIT

Orchestra-Research/development·4,249 tokens

@orchestra-research/mamba-architecture

skill

State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV cache. Selective SSM with hardware-aware design. Mamba-1 (d_state=16) and Mamba-2 (d_state=128, multi-head). Models 130M-2.8B on HuggingFace.

★ 5,030MIT

Orchestra-Research/development·2,097 tokens

@orchestra-research/audiocraft-audio-generation

skill

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.

★ 5,030MIT

Orchestra-Research/development·3,758 tokens

@orchestra-research/modal-serverless-gpu

skill

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

★ 5,030MIT

Orchestra-Research/development·2,149 tokens

@orchestra-research/model-pruning

skill

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

★ 5,030MIT

Orchestra-Research/development·3,762 tokens

@orchestra-research/lambda-labs-gpu-cloud

skill

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

★ 5,030MIT

Orchestra-Research/development·3,273 tokens

@orchestra-research/nemo-curator

skill

GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cleaning web data, or deduplicating large corpora.

★ 5,030MIT

Orchestra-Research/development·2,513 tokens

@orchestra-research/fine-tuning-with-trl

skill

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.

★ 5,030MIT

Orchestra-Research/development·3,079 tokens

@orchestra-research/mlflow

skill

Track ML experiments, manage model registry with versioning, deploy models to production, and reproduce experiments with MLflow - framework-agnostic ML lifecycle platform

★ 5,030MIT

Orchestra-Research/development·3,954 tokens

@orchestra-research/simpo-training

skill

Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use for preference alignment when want simpler, faster training than DPO/PPO.

★ 5,030MIT

Orchestra-Research/development·1,661 tokens

@orchestra-research/whisper

skill

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

★ 5,030MIT

Orchestra-Research/development·2,028 tokens

@orchestra-research/axolotl

skill

Expert guidance for fine-tuning LLMs with Axolotl - YAML configs, 100+ models, LoRA/QLoRA, DPO/KTO/ORPO/GRPO, multimodal support

★ 5,030MIT

Orchestra-Research/development·1,144 tokens

@orchestra-research/gguf-quantization

skill

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

★ 5,030MIT

Orchestra-Research/development·3,147 tokens

@orchestra-research/transformer-lens-interpretability

skill

Provides guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints and activation caching. Use when reverse-engineering model algorithms, studying attention patterns, or performing activation patching experiments.

★ 5,030MIT

Orchestra-Research/development·3,056 tokens

@orchestra-research/gptq

skill

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory reduction with <2% perplexity degradation, or for faster inference (3-4× speedup) vs FP16. Integrates with transformers and PEFT for QLoRA fine-tuning.

★ 5,030MIT

Orchestra-Research/development·3,462 tokens

Prev 1...36 37 38...144 Next

Page 37 of 144