skill en agent skills

apm::packages

@orchestra-research/pyvene-interventions

Provides guidance for performing causal interventions on PyTorch models using pyvene's declarative intervention framework. Use when conducting causal tracing, activation patching, interchange intervention training, or testing causal hypotheses about model behavior.

★ 5,030MIT

Orchestra-Research/development·3,358 tokens

@orchestra-research/evaluating-code-models

skill

Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding abilities, testing multi-language support, or measuring code generation quality. Industry standard from BigCode Project used by HuggingFace leaderboards.

★ 5,030MIT

Orchestra-Research/development·3,244 tokens

@orchestra-research/distributed-llm-pretraining-torchtitan

skill

Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or custom models at scale from 8 to 512+ GPUs with Float8, torch.compile, and distributed checkpointing.

★ 5,030MIT

Orchestra-Research/development·2,634 tokens

@orchestra-research/ray-train

skill

Distributed training orchestration across clusters. Scales PyTorch/TensorFlow/HuggingFace from laptop to 1000s of nodes. Built-in hyperparameter tuning with Ray Tune, fault tolerance, elastic scaling. Use when training massive models across multiple machines or running distributed hyperparameter sweeps.

★ 5,030MIT

Orchestra-Research/development·2,537 tokens

@orchestra-research/moe-training

skill

Train Mixture of Experts (MoE) models using DeepSpeed or HuggingFace. Use when training large-scale models with limited compute (5× cost reduction vs dense models), implementing sparse architectures like Mixtral 8x7B or DeepSeek-V3, or scaling model capacity without proportional compute increase. Covers MoE architectures, routing mechanisms, load balancing, expert parallelism, and inference optimization.

★ 5,030MIT

Orchestra-Research/development·4,130 tokens

@orchestra-research/pytorch-lightning

skill

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.

★ 5,030MIT

Orchestra-Research/development·2,254 tokens

@orchestra-research/speculative-decoding

skill

Accelerate LLM inference using speculative decoding, Medusa multiple heads, and lookahead decoding techniques. Use when optimizing inference speed (1.5-3.6× speedup), reducing latency for real-time applications, or deploying models with limited compute. Covers draft models, tree-based attention, Jacobi iteration, parallel token generation, and production deployment strategies.

★ 5,030MIT

Orchestra-Research/development·3,753 tokens

@orchestra-research/evaluating-llms-harness

skill

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking training progress. Industry standard used by EleutherAI, HuggingFace, and major labs. Supports HuggingFace, vLLM, APIs.

★ 5,030MIT

Orchestra-Research/development·3,474 tokens

@orchestra-research/segment-anything-model

skill

Foundation model for image segmentation with zero-shot transfer. Use when you need to segment any object in images using points, boxes, or masks as prompts, or automatically generate all object masks in an image.

★ 5,030MIT

Orchestra-Research/development·3,395 tokens

@orchestra-research/optimizing-attention-flash

skill

Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/running transformers with long sequences (>512 tokens), encountering GPU memory issues with attention, or need faster inference. Supports PyTorch native SDPA, flash-attn library, H100 FP8, and sliding window attention.

★ 5,030MIT

Orchestra-Research/development·2,901 tokens

@orchestra-research/guidance

skill

Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained generation framework

★ 5,030MIT

Orchestra-Research/development·3,983 tokens

@orchestra-research/instructor

skill

Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, parse complex JSON with type safety, and stream partial results with Instructor - battle-tested structured output library

★ 5,030MIT

Orchestra-Research/development·4,250 tokens

@orchestra-research/dspy

skill

Build complex AI systems with declarative programming, optimize prompts automatically, create modular RAG systems and agents with DSPy - Stanford NLP's framework for systematic LM programming

★ 5,030MIT

Orchestra-Research/development·3,735 tokens

@orchestra-research/outlines

skill

Guarantee valid JSON/XML/code structure during generation, use Pydantic models for type-safe outputs, support local models (Transformers, vLLM), and maximize inference speed with Outlines - dottxt.ai's structured generation library

★ 5,030MIT

Orchestra-Research/development·4,034 tokens

@orchestra-research/slime-rl-training

skill

Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM integration for RL scaling.

★ 5,030MIT

Orchestra-Research/development·2,969 tokens

@orchestra-research/implementing-llms-litgpt

skill

Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen, Mistral). Use when need clean model implementations, educational understanding of architectures, or production fine-tuning with LoRA/QLoRA. Single-file implementations, no abstraction layers.

★ 5,030MIT

Orchestra-Research/development·3,217 tokens

@vercel/web-design-guidelines

✓skill

Review UI code for Web Interface Guidelines compliance. Use when asked to "review my UI", "check accessibility", "audit design", "review UX", or "check my site against best practices".

★ 4,991MIT

vercel/design·308 tokens·design

@microsoft/policy-check

✓skill

This skill should be used when the user asks to "run policy check", "check policy", "policy-check", or needs to validate package compliance. Provides guidance on running policy checks for specific packages or the entire repository.

★ 4,918MIT

microsoft/devops·153 tokens·devops

@microsoft/fluid-release

✓skill

Fluid Framework client release group — minor releases, patch releases, and post-release type test updates. Covers release prep, branching, version bumps, changelogs, release notes, and type test baselines. In autonomous mode, auto-detects state from the schedule and repo, attempts to execute, and falls back to a GitHub issue on failure. Triggers on "release", "do the release", "release status", version bump, release notes, changelog, release branch, or release engineering.

★ 4,918MIT

microsoft/git-workflow·3,375 tokens·gittesting

@microsoft/trigger-pipelines-for-copilot-pr

✓skill

Trigger ADO pipelines for a Copilot-created PR by posting /azp run comments. Use when the user asks to trigger CI pipelines for a specific PR.

★ 4,918MIT

microsoft/development·433 tokens·gittestingapi-design

Prev 1...54 55 56...237 Next

Page 55 of 237