Data Agent User Guide
The Data Agent is Simba Intelligence’s AI-powered tool for automatically creating data sources. Instead of manually configuring database connections and schemas, you simply describe what you need or upload a screenshot of your desired dashboard, and the AI agent handles the technical setup for you.Overview
What is the Data Agent?
Think of the Data Agent as your intelligent data assistant. Just as you might describe a project to a colleague and have them set up the necessary resources, the Data Agent listens to your data requirements and automatically:- Analyzes your needs from text descriptions or dashboard images
- Connects to your databases using available data connections
- Creates optimized data sources ready for querying
- Configures relationships and fields based on your requirements
When to Use the Data Agent
Perfect for:- Creating new data sources for dashboards or reports
- Prototyping data connections quickly
- Converting dashboard mockups into working data sources
- Exploring new databases without manual schema analysis
- “I need to analyze our sales performance by region and product category”
- “Create a data source for this customer dashboard mockup”
- “Set up connections to track inventory levels across our warehouses”
Prerequisites
Before using the Data Agent, ensure you have:1. Required Permissions
- ROLE_CREATE_SOURCES permission or higher
- Access to the Data Agent interface (typically at
/data-source-agent)
2. LLM Configuration
- At least one AI provider configured (Google Vertex AI, OpenAI, or AWS Bedrock)
- Active AI provider with sufficient quota/credits
3. Data Connections
- At least one data connection configured and available
- Network connectivity from Simba Intelligence to your target databases
- Appropriate database permissions for the data connection credentials
💡 Tip: If you don’t have these prerequisites, the Data Agent will guide you through the setup process or direct you to the appropriate configuration pages.
Getting Started
Step 1: Access the Data Agent
- Navigate to the Data Agent:
- Go to the main Simba Intelligence interface
- Click “Data Agent” in the navigation menu
- Or visit directly:
http://your-domain/data-source-agent
- Verify prerequisites:
- The system automatically checks for LLM configuration
- If missing, you’ll be prompted to configure an AI provider first
Step 2: Initial Setup Check
When you first access the Data Agent, it performs automatic validation:- ✅ LLM Provider: Confirms AI services are configured and accessible
- ✅ Data Connections: Verifies available database connections
- ✅ Permissions: Ensures you have rights to create data sources
Creating Data Sources
The Data Agent supports two primary input methods for understanding your requirements:Method 1: Text Description
Best for: Clear data requirements, existing knowledge of your data structure- Click “Get Started” on the Data Agent welcome screen
- Describe your data needs in the text area (optional). Be specific about:
- What data you want to analyze
- Key metrics or fields of interest
- Intended use case or analysis goals
Method 2: Image Analysis (Dashboard Mockups)
Best for: Visual dashboard designs, mockups, screenshots of desired outputs- Upload an image of your desired dashboard or data visualization
- Supported image formats: PNG, JPG, JPEG (max 10MB)
- Drag and drop or click to browse for your image file
- Add context (optional): Include additional text description to clarify requirements
- Chart types and visualizations
- Data labels and field names
- Filters and interactive elements
- Layout and grouping patterns
- Implied data relationships
🔍 Image Analysis Tip: The AI works best with clear, well-labeled dashboard mockups. Hand-drawn sketches work too, as long as labels and relationships are visible.
Method 3: Combined Approach
For best results, combine both methods:- Upload a dashboard mockup or screenshot
- Add text description with additional context and requirements
- Specify any particular data sources or constraints
The AI Agent Workflow
Once you provide your requirements, the Data Agent follows an intelligent workflow:Phase 1: Requirement Analysis
What happens:- AI processes your text description and/or image
- Identifies key data entities, relationships, and metrics
- Determines optimal data source structure
- Real-time analysis progress updates
- Extracted requirements summary
- Identified data patterns and relationships
Phase 2: Connection Selection
What happens:- System presents available data connections
- You select which database connection to use
Phase 3: Data Source Creation
What happens:- AI agent connects to selected database
- Analyzes schema and available data
- Creates optimized data source configuration
- Tests queries and validates data access
- Database connection establishment
- Schema analysis progress
- Query generation and testing
- Data source optimization
Phase 4: Results and Next Steps
What happens:- Data source is created and ready for use
- System provides access options
- Suggests next steps for data exploration
Understanding AI Agent Progress
The Data Agent provides detailed, real-time feedback during the creation process:Progress Event Types
📋 Info Events: General progress updates and status information 🚀 Start Events: Beginning of major workflow phases ✅ Complete Events: Successful completion of tasks ⚠️ Warning Events: Non-critical issues or suggestions ❌ Error Events: Problems requiring attentionTypical Progress Flow
- “Analyzing your requirements…”
- Processing text description or image
- Identifying key data elements
- “Connecting to database…”
- Establishing connection via selected data connection
- Validating access permissions
- “Exploring database schema…”
- Discovering available tables and relationships
- Mapping to your requirements
- “Creating data source configuration…”
- Building optimized queries and views
- Setting up field mappings and relationships
- “Testing and validating…”
- Running test queries
- Verifying data accessibility
- “Data source created successfully!”
- Final configuration complete
- Ready for querying
Working with Results
When Data Source Creation Succeeds
You’ll receive:- ✅ Confirmation message with data source details
- 🔗 Direct links to view or query the new data source
- 📊 Summary of created fields and relationships
- “Open Created Data Source”
- Takes you to the Data Sources management page
- Shows your new data source highlighted
- Allows for manual configuration
- “Go to Playground”
- Opens the Playground interface with your data source pre-selected
- Ready for immediate natural language querying
- Perfect for testing and validation
If Creation Encounters Issues
Common scenarios and solutions: Permission Issues:- Verify database user has appropriate read permissions
- Check that connection configuration is correct
- Contact your database administrator if needed
- AI may need more specific guidance for complex databases
- Try breaking down requirements into smaller, focused data sources
- Consider creating multiple related data sources
- Ensure network connectivity between Simba Intelligence and target database
- Verify firewall rules allow database connections
- Check that database server is accessible and running
Best Practices
Writing Effective Descriptions
✅ Do:- Be specific about metrics and dimensions you need
- Mention intended use cases (reports, dashboards, analysis)
- Include relevant business context
- Specify time ranges or filters if important
- Vague requests like “connect to our database”
- Technical jargon without business context
- Overly complex requirements in a single request
- Assumptions about database structure
Image Upload Guidelines
For best results:- Use clear, well-labeled mockups or screenshots
- Ensure text in images is readable
- Include legends and axis labels
- Show the full dashboard or visualization context
- Use high-contrast images (dark text on light background)
Managing Complex Requirements
Break down large projects:- Start with core metrics and entities
- Create foundational data sources first
- Build additional sources for related data
- Combine sources in the Playground interface
- Create a basic data source first
- Test with sample queries
- Refine or create additional sources as needed
Advanced Features
Image Analysis Details
When you upload an image, the AI performs sophisticated analysis: Visual Component Recognition:- Chart types (bar, line, pie, tables, etc.)
- Data labels and field names
- Filter controls and parameters
- Layout and grouping structures
- Dashboard overview and purpose
- Individual visual components
- Required filters and parameters
- Data source recommendations
- Implementation suggestions
Context Combination
The most powerful approach combines multiple inputs:Data Source Refinement
After creation, you can:- Modify field selections through the Data Sources interface
- Adjust relationships between tables
- Add calculated fields or custom metrics
- Configure security filters for different user groups
Troubleshooting Common Issues
”No LLM Configuration Found”
Problem: AI provider not configured or accessible Solution:- Navigate to LLM Configuration page
- Add at least one AI provider (Vertex AI, OpenAI, etc.)
- Verify credentials and test connection
- Return to Data Agent
”No Data Connections Available”
Problem: No database connections configured Solution:- Go to Data Connections page
- Add connection to your database (SQL Server, PostgreSQL, Snowflake)
- Test connectivity
- Return to Data Agent
”Data Agent Execution Failed”
Possible causes and solutions: Network connectivity:- Check database server is accessible
- Verify firewall rules and network configuration
- Ensure connection has proper network access
- Verify connection user has SELECT permissions on required tables
- Check database user exists and password is correct
- Ensure database allows connections from Simba Intelligence
- AI provider may be rate-limited or out of quota
- Database may be under heavy load
- Check system resources and retry
”Incomplete Data Source Configuration”
Problem: AI couldn’t fully analyze or configure requirements Solution:- Review and simplify your description
- Try breaking complex requirements into smaller pieces
- Ensure uploaded images are clear and well-labeled
- Provide additional context or constraints
Security and Governance
Data Access Control
The Data Agent respects existing security configurations:- Database permissions: Only accesses data the connector user can read
- User roles: Honors your Simba Intelligence permission level
- Data source security: Applies any configured row-level or column-level security
Audit and Compliance
All Data Agent activities are logged:- Creation requests: Who created what data source and when
- AI interactions: Queries sent to AI providers (without sensitive data)
- Database access: What tables and schemas were analyzed
- Success/failure events: Complete audit trail of all operations
Privacy Considerations
Data processing:- Text descriptions are sent to AI providers for analysis
- Database schema information (table/column names) may be shared with AI
- Actual data values are never sent to external AI services
- Image uploads are processed by AI vision services
- All AI provider communications use encrypted connections
- Credentials and sensitive configuration stored securely
- Database connections use provided authentication
Integration with Other Features
Using Created Data Sources
After creating a data source with the Data Agent: In the Playground:- Natural language querying against your data
- Real-time results with explanations
- Export capabilities for further analysis
- Fine-tune field mappings and relationships
- Configure additional security or performance settings
- Monitor usage and performance metrics
- Programmatic access to your created data sources
- Integration with custom applications
- Automated query execution and result processing
Combining Multiple Data Sources
Create related data sources and combine them:- Use Data Agent to create core data source
- Create additional sources for related data
- Use Playground to query across multiple sources
- Build comprehensive analyses combining different data sets
Tips for Success
Getting Better Results
- Start simple: Create basic data sources before tackling complex requirements
- Iterate: Use initial results to refine and create additional sources
- Test immediately: Verify data sources work as expected using Playground
- Document requirements: Keep notes on successful patterns for future use
Performance Optimization
- Focus requirements: Specific, focused data sources perform better than broad ones
- Use appropriate data connection: Match data source to optimal database connection
- Monitor query performance: Check how created sources perform in practice
- Refine as needed: Adjust data source configuration based on actual usage
Collaboration
- Share successful patterns: Document effective description templates
- Coordinate data connection usage: Ensure team members don’t overwhelm database connections
- Review created sources: Have team members validate AI-generated configurations
- Build incrementally: Create foundational sources that others can extend
Ready to create your first AI-powered data source? Head to the Data Agent and describe what you need. The AI will handle the technical details while you focus on your analysis goals.

