Launching the Shell
Example Interaction
Features
Auto-completion
The shell provides auto-completion for available commands. Start typing/ and press Tab to see suggestions.
Conversation History
The shell automatically maintains conversation context, allowing for multi-turn dialogues. All previous messages are sent with each request to preserve context:Streaming Output
Responses are streamed token-by-token for a more interactive experience, similar to ChatGPT.Use Cases
Model Testing
Quickly test model behavior and quality before deploying to production
Prompt Engineering
Experiment with different prompts and conversation flows
Debugging
Verify model outputs and troubleshoot generation issues
Demonstrations
Show model capabilities in live demos and presentations
Limitations
- Shell mode does not support dummy weights (testing mode)
- History is stored in memory only and lost when the process exits
- No support for multi-user scenarios
- Single conversation thread only