Question 1

Can Cursor or GitHub Copilot work in air-gapped environments?

Accepted Answer

No. Cursor, GitHub Copilot, and Windsurf all require internet connectivity to function. Their AI processing happens on cloud servers, which means code must traverse the public internet to reach the AI model. Cursor's Ghost Mode allows local model use but does not support the full feature set and was not designed for air-gapped deployment. For environments where internet access is prohibited, these tools are categorically excluded.

Question 2

What models can Fabric run in an air-gapped environment?

Accepted Answer

Fabric supports any model that can be served via a vLLM, Ollama, or OpenAI-compatible API endpoint on local infrastructure. Proven options include GLM-5, Qwen 3.5 (including the 397B MoE variant), Llama 3.3, Mistral Large, DeepSeek-V3, and CodeStral. Model selection depends on your available GPU hardware and the quality-performance trade-off your team is willing to accept.

Question 3

What hardware is required for on-premise AI model hosting?

Accepted Answer

Requirements vary by model size. A 70B parameter model requires approximately 140GB of GPU VRAM in FP16 (two A100 80GB GPUs or equivalent). Smaller models (7B-14B) can run on a single consumer GPU. Quantized models (FP8, GPTQ, AWQ) reduce VRAM requirements by 30-50%. For enterprise deployment, we recommend at least 4x A100 GPUs to serve a 70B+ model with acceptable latency for a team of 20-50 developers. For maximum performance with frontier-scale MoE models, the NVIDIA GB200 NVL72 provides 13.5TB of unified HBM3e memory and 1.4 exaflops of FP4 compute — enough to serve multiple 400B+ models simultaneously with room to spare.

Question 4

How does air-gapped AI coding compare to cloud-based AI coding in quality?

Accepted Answer

Honestly, local models are currently less capable than frontier cloud models like Claude Opus, GPT-4o, or Gemini 2.5 Pro. The gap is narrowing — open models like Qwen 3.5 and GLM-5 deliver strong performance on coding tasks — but a quality trade-off exists. Fabric lets you calibrate this dial: use cloud models when allowed, switch to local models when sovereignty requires it. The transition is seamless, and all context carries over.

Question 5

Does Fabric support ITAR and FedRAMP compliance?

Accepted Answer

Fabric's architecture supports deployment within ITAR-controlled and FedRAMP-authorized environments. In air-gapped mode, no data leaves the controlled environment — there are no cloud API calls, no telemetry, no external dependencies. The compliance posture is determined by your infrastructure configuration rather than vendor policy. Fabric's deployment is designed to operate within existing approved infrastructure boundaries.

Question 6

Can I start with cloud and migrate to air-gapped later?

Accepted Answer

Yes. This is a core design principle of Fabric. You can start with cloud-hosted models for maximum capability, then reconfigure to on-premise deployment when requirements change — regulatory shifts, new contract requirements, or organizational policy changes. All conversation history, context, project settings, and workflow configurations migrate with you. No work is lost in the transition.

Model	Parameters	Min. VRAM	Strengths
GLM-5	Multiple sizes	Varies	Strong multilingual coding, long context
Qwen 3.5 397B (MoE)	397B (17B active)	8x A100 80GB or GB200 NVL72	Near-frontier coding capability, 256K context
Qwen 3.5 32B	32B	2x A100 40GB	Excellent cost-performance ratio for coding
Llama 3.3 70B	70B	2x A100 80GB	Strong general-purpose coding, widely deployed
Mistral Large	123B	4x A100 80GB	Strong reasoning, enterprise-oriented
DeepSeek-V3	671B (37B active)	GB200 NVL72 or 8x A100 80GB	Excellent code generation, MoE efficiency
CodeStral	22B	1x A100 40GB	Purpose-built for code, fast inference

Framework	Requirement	How Fabric Addresses It
GDPR	Data residency within EU	On-premise deployment ensures code and AI interactions never leave your EU infrastructure. No data transfer to US or other jurisdictions.
PIPEDA	Canadian data sovereignty	Air-gapped deployment on Canadian infrastructure. Zero cross-border data flow. Architectural sovereignty, not just contractual.
ITAR	Technical data access controls	All AI processing occurs within ITAR-controlled environment. No technical data is transmitted externally. Self-hosted models process exclusively local data.
FedRAMP	Authorized infrastructure boundaries	Fabric deploys within FedRAMP-authorized infrastructure (AWS GovCloud, Azure Government). No external dependencies beyond the authorized boundary.
ISO 27001	Information security management	Air-gapped deployment satisfies the strictest interpretation of access controls, data classification, and asset management requirements.
SOC 2 Type II	Security, availability, confidentiality	Fabric's cloud infrastructure is SOC 2 Type II compliant. Air-gapped deployments inherit the security posture of your controlled environment.

Air-Gapped AI IDE: Secure AI Coding Without Internet Access

Why Air-Gapped Development Matters

Defense and Classified Environments

Regulatory Compliance

AI Sovereignty Concerns

Intellectual Property Protection

The Challenge: Cloud-Dependent AI IDEs

How Fabric Enables Air-Gapped AI Coding

Self-Hosted Models

Complete Offline Operation

Work Portability

Supported Models for Air-Gapped Deployment

The Performance Trade-Off

Deployment Architecture

Container-Based Deployment

Kubernetes Orchestration

VPC and Private Cloud

Compliance Framework Support

Frequently Asked Questions