skill agent skills

apm::packages

@orchestra-research/knowledge-distillation

Compress large language models using knowledge distillation from teacher to student models. Use when deploying smaller models with retained performance, transferring GPT-4 capabilities to open-source models, or reducing inference costs. Covers temperature scaling, soft targets, reverse KLD, logit distillation, and MiniLLM training strategies.

★ 5,030MIT

Orchestra-Research/development·3,411 tokens

@orchestra-research/hqq-quantization

skill

Half-Quadratic Quantization for LLMs without calibration data. Use when quantizing models to 4/3/2-bit precision without needing calibration datasets, for fast quantization workflows, or when deploying with vLLM or HuggingFace Transformers.

★ 5,030MIT

Orchestra-Research/development·3,222 tokens

@orchestra-research/ml-paper-writing

skill

Write publication-ready ML/AI/Systems papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM, OSDI, NSDI, ASPLOS, SOSP. Use when drafting papers from research repos, structuring arguments, verifying citations, or preparing camera-ready submissions. Includes LaTeX templates, reviewer guidelines, and citation verification workflows.

★ 5,030MIT

Orchestra-Research/development·9,418 tokens

@orchestra-research/qdrant-vector-search

skill

High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.

★ 5,030MIT

Orchestra-Research/development·3,295 tokens

@orchestra-research/llamaindex

skill

Data framework for building LLM applications with RAG. Specializes in document ingestion (300+ connectors), indexing, and querying. Features vector indices, query engines, agents, and multi-modal support. Use for document Q&A, chatbots, knowledge retrieval, or building RAG pipelines. Best for data-centric LLM applications.

★ 5,030MIT

Orchestra-Research/development·3,506 tokens

@orchestra-research/awq-quantization

skill

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.

★ 5,030MIT

Orchestra-Research/development·2,482 tokens

@orchestra-research/evaluating-code-models

skill

Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding abilities, testing multi-language support, or measuring code generation quality. Industry standard from BigCode Project used by HuggingFace leaderboards.

★ 5,030MIT

Orchestra-Research/development·3,244 tokens

@orchestra-research/pytorch-lightning

skill

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.

★ 5,030MIT

Orchestra-Research/development·2,254 tokens

@orchestra-research/speculative-decoding

skill

Accelerate LLM inference using speculative decoding, Medusa multiple heads, and lookahead decoding techniques. Use when optimizing inference speed (1.5-3.6× speedup), reducing latency for real-time applications, or deploying models with limited compute. Covers draft models, tree-based attention, Jacobi iteration, parallel token generation, and production deployment strategies.

★ 5,030MIT

Orchestra-Research/development·3,753 tokens

@orchestra-research/sparse-autoencoder-training

skill

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

★ 5,030MIT

Orchestra-Research/development·3,272 tokens

@orchestra-research/clip

skill

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning. Best for general-purpose image understanding.

★ 5,030MIT

Orchestra-Research/development·1,752 tokens

@orchestra-research/tensorboard

skill

Visualize training metrics, debug models with histograms, compare experiments, visualize model graphs, and profile performance with TensorBoard - Google's ML visualization toolkit

★ 5,030MIT

Orchestra-Research/development·3,808 tokens

@orchestra-research/peft-fine-tuning

skill

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

★ 5,030MIT

Orchestra-Research/development·3,463 tokens

@vercel/web-design-guidelines

✓skill

Review UI code for Web Interface Guidelines compliance. Use when asked to "review my UI", "check accessibility", "audit design", "review UX", or "check my site against best practices".

★ 4,991MIT

vercel/design·308 tokens·design

@microsoft/fluid-release

✓skill

Fluid Framework client release group — minor releases, patch releases, and post-release type test updates. Covers release prep, branching, version bumps, changelogs, release notes, and type test baselines. In autonomous mode, auto-detects state from the schedule and repo, attempts to execute, and falls back to a GitHub issue on failure. Triggers on "release", "do the release", "release status", version bump, release notes, changelog, release branch, or release engineering.

★ 4,918MIT

microsoft/git-workflow·3,375 tokens·gittesting

@microsoft/trigger-pipelines-for-copilot-pr

✓skill

Trigger ADO pipelines for a Copilot-created PR by posting /azp run comments. Use when the user asks to trigger CI pipelines for a specific PR.

★ 4,918MIT

microsoft/development·433 tokens·gittestingapi-design

@microsoft/policy-check

✓skill

This skill should be used when the user asks to "run policy check", "check policy", "policy-check", or needs to validate package compliance. Provides guidance on running policy checks for specific packages or the entire repository.

★ 4,918MIT

microsoft/devops·153 tokens·devops

@microsoft/worktree-manager

✓skill

Create and manage Git worktrees for parallel development workflows. Use when multiple self-contained issues should NOT be fixed in a single branch, when human-Copilot iteration requires isolated environments with separate chat history and commits, or when parallel work items need independent build/test results. Triggers on requests involving branch isolation, work item separation, parallel development, or avoiding messy branch switching.

★ 4,457MIT

microsoft/development·2,048 tokens·gittesting

@microsoft/issue-triage-report

✓skill

Generate comprehensive GitHub Feature Area Status reports for the Windows App SDK repository. Use when asked to create triage reports, identify high-priority issues, analyze feature area health, find issues needing attention, or generate status dashboards. Triggers on requests involving issue triage, area status, priority analysis, bug tracking reports, or engineering team focus areas.

★ 4,457MIT

microsoft/development·2,368 tokens·git

@microsoft/triage-meeting-prep

✓skill

Prepare weekly triage meeting summary for WinAppSDK Needs-Triage issues. Use when preparing for triage meetings, reviewing Needs-Triage issues, generating diff reports since last triage, summarizing new or updated issues, or creating action item recommendations. Triggers on requests involving triage preparation, Needs-Triage review, meeting summary, triage diff, or weekly issue analysis.

★ 4,457MIT

microsoft/development·2,397 tokens·git

Prev 1...39 40 41...148 Next

Page 40 of 148