# Shared ## Domain Types ### Cloud Astra DB Vector Store - `class CloudAstraDbVectorStore:` Cloud AstraDB Vector Store. This class is used to store the configuration for an AstraDB vector store, so that it can be created and used in LlamaCloud. Args: token (str): The Astra DB Application Token to use. api_endpoint (str): The Astra DB JSON API endpoint for your database. collection_name (str): Collection name to use. If not existing, it will be created. embedding_dimension (int): Length of the embedding vectors in use. keyspace (optional[str]): The keyspace to use. If not provided, 'default_keyspace' - `String token` The Astra DB Application Token to use - `String apiEndpoint` The Astra DB JSON API endpoint for your database - `String collectionName` Collection name to use. If not existing, it will be created - `long embeddingDimension` Length of the embedding vectors in use - `Optional className` - `Optional keyspace` The keyspace to use. If not provided, 'default_keyspace' - `Optional supportsNestedMetadataFilters` - `TRUE(true)` ### Cloud Az Storage Blob Data Source - `class CloudAzStorageBlobDataSource:` - `String accountUrl` The Azure Storage Blob account URL to use for authentication. - `String containerName` The name of the Azure Storage Blob container to read from. - `Optional accountKey` The Azure Storage Blob account key to use for authentication. - `Optional accountName` The Azure Storage Blob account name to use for authentication. - `Optional blob` The blob name to read from. - `Optional className` - `Optional clientId` The Azure AD client ID to use for authentication. - `Optional clientSecret` The Azure AD client secret to use for authentication. - `Optional prefix` The prefix of the Azure Storage Blob objects to read from. - `Optional supportsAccessControl` - `Optional tenantId` The Azure AD tenant ID to use for authentication. ### Cloud Azure AI Search Vector Store - `class CloudAzureAiSearchVectorStore:` Cloud Azure AI Search Vector Store. - `String searchServiceApiKey` - `String searchServiceEndpoint` - `Optional className` - `Optional clientId` - `Optional clientSecret` - `Optional embeddingDimension` - `Optional filterableMetadataFieldKeys` - `Optional indexName` - `Optional searchServiceApiVersion` - `Optional supportsNestedMetadataFilters` - `TRUE(true)` - `Optional tenantId` ### Cloud Box Data Source - `class CloudBoxDataSource:` - `AuthenticationMechanism authenticationMechanism` The type of authentication to use (Developer Token or CCG) - `DEVELOPER_TOKEN("developer_token")` - `CCG("ccg")` - `Optional className` - `Optional clientId` Box API key used for identifying the application the user is authenticating with - `Optional clientSecret` Box API secret used for making auth requests. - `Optional developerToken` Developer token for authentication if authentication_mechanism is 'developer_token'. - `Optional enterpriseId` Box Enterprise ID, if provided authenticates as service. - `Optional folderId` The ID of the Box folder to read from. - `Optional supportsAccessControl` - `Optional userId` Box User ID, if provided authenticates as user. ### Cloud Confluence Data Source - `class CloudConfluenceDataSource:` - `String authenticationMechanism` Type of Authentication for connecting to Confluence APIs. - `String serverUrl` The server URL of the Confluence instance. - `Optional apiToken` The API token to use for authentication. - `Optional className` - `Optional cql` The CQL query to use for fetching pages. - `Optional failureHandling` Configuration for handling failures during processing. Key-value object controlling failure handling behaviors. Example: { "skip_list_failures": true } Currently supports: - skip_list_failures: Skip failed batches/lists and continue processing - `Optional skipListFailures` Whether to skip failed batches/lists and continue processing - `Optional indexRestrictedPages` Whether to index restricted pages. - `Optional keepMarkdownFormat` Whether to keep the markdown format. - `Optional label` The label to use for fetching pages. - `Optional pageIds` The page IDs of the Confluence to read from. - `Optional spaceKey` The space key to read from. - `Optional supportsAccessControl` - `Optional userName` The username to use for authentication. ### Cloud Google Drive Data Source - `class CloudGoogleDriveDataSource:` - `String folderId` The ID of the Google Drive folder to read from. - `Optional className` - `Optional serviceAccountKey` A dictionary containing secret values - `Optional supportsAccessControl` ### Cloud Jira Data Source - `class CloudJiraDataSource:` Cloud Jira Data Source integrating JiraReader. - `String authenticationMechanism` Type of Authentication for connecting to Jira APIs. - `String query` JQL (Jira Query Language) query to search. - `Optional apiToken` The API/ Access Token used for Basic, PAT and OAuth2 authentication. - `Optional className` - `Optional cloudId` The cloud ID, used in case of OAuth2. - `Optional email` The email address to use for authentication. - `Optional serverUrl` The server url for Jira Cloud. - `Optional supportsAccessControl` ### Cloud Jira Data Source V2 - `class CloudJiraDataSourceV2:` Cloud Jira Data Source integrating JiraReaderV2. - `String authenticationMechanism` Type of Authentication for connecting to Jira APIs. - `String query` JQL (Jira Query Language) query to search. - `String serverUrl` The server url for Jira Cloud. - `Optional apiToken` The API Access Token used for Basic, PAT and OAuth2 authentication. - `Optional apiVersion` Jira REST API version to use (2 or 3). 3 supports Atlassian Document Format (ADF). - `_2("2")` - `_3("3")` - `Optional className` - `Optional cloudId` The cloud ID, used in case of OAuth2. - `Optional email` The email address to use for authentication. - `Optional expand` Fields to expand in the response. - `Optional> fields` List of fields to retrieve from Jira. If None, retrieves all fields. - `Optional getPermissions` Whether to fetch project role permissions and issue-level security - `Optional requestsPerMinute` Rate limit for Jira API requests per minute. - `Optional supportsAccessControl` ### Cloud Milvus Vector Store - `class CloudMilvusVectorStore:` Cloud Milvus Vector Store. - `String uri` - `Optional token` - `Optional className` - `Optional collectionName` - `Optional embeddingDimension` - `Optional supportsNestedMetadataFilters` ### Cloud MongoDB Atlas Vector Search - `class CloudMongoDBAtlasVectorSearch:` Cloud MongoDB Atlas Vector Store. This class is used to store the configuration for a MongoDB Atlas vector store, so that it can be created and used in LlamaCloud. Args: mongodb_uri (str): URI for connecting to MongoDB Atlas db_name (str): name of the MongoDB database collection_name (str): name of the MongoDB collection vector_index_name (str): name of the MongoDB Atlas vector index fulltext_index_name (str): name of the MongoDB Atlas full-text index - `String collectionName` - `String dbName` - `String mongoDBUri` - `Optional className` - `Optional embeddingDimension` - `Optional fulltextIndexName` - `Optional supportsNestedMetadataFilters` - `Optional vectorIndexName` ### Cloud Notion Page Data Source - `class CloudNotionPageDataSource:` - `String integrationToken` The integration token to use for authentication. - `Optional className` - `Optional databaseIds` The Notion Database Id to read content from. - `Optional pageIds` The Page ID's of the Notion to read from. - `Optional supportsAccessControl` ### Cloud One Drive Data Source - `class CloudOneDriveDataSource:` - `String clientId` The client ID to use for authentication. - `String clientSecret` The client secret to use for authentication. - `String tenantId` The tenant ID to use for authentication. - `String userPrincipalName` The user principal name to use for authentication. - `Optional className` - `Optional folderId` The ID of the OneDrive folder to read from. - `Optional folderPath` The path of the OneDrive folder to read from. - `Optional> requiredExts` The list of required file extensions. - `Optional supportsAccessControl` - `TRUE(true)` ### Cloud Pinecone Vector Store - `class CloudPineconeVectorStore:` Cloud Pinecone Vector Store. This class is used to store the configuration for a Pinecone vector store, so that it can be created and used in LlamaCloud. Args: api_key (str): API key for authenticating with Pinecone index_name (str): name of the Pinecone index namespace (optional[str]): namespace to use in the Pinecone index insert_kwargs (optional[dict]): additional kwargs to pass during insertion - `String apiKey` The API key for authenticating with Pinecone - `String indexName` - `Optional className` - `Optional insertKwargs` - `Optional namespace` - `Optional supportsNestedMetadataFilters` - `TRUE(true)` ### Cloud Postgres Vector Store - `class CloudPostgresVectorStore:` - `String database` - `long embedDim` - `String host` - `String password` - `long port` - `String schemaName` - `String tableName` - `String user` - `Optional className` - `Optional hnswSettings` HNSW settings for PGVector. - `Optional distanceMethod` The distance method to use. - `L2("l2")` - `IP("ip")` - `COSINE("cosine")` - `L1("l1")` - `HAMMING("hamming")` - `JACCARD("jaccard")` - `Optional efConstruction` The number of edges to use during the construction phase. - `Optional efSearch` The number of edges to use during the search phase. - `Optional m` The number of bi-directional links created for each new element. - `Optional vectorType` The type of vector to use. - `VECTOR("vector")` - `HALF_VEC("half_vec")` - `BIT("bit")` - `SPARSE_VEC("sparse_vec")` - `Optional hybridSearch` - `Optional performSetup` - `Optional supportsNestedMetadataFilters` ### Cloud Qdrant Vector Store - `class CloudQdrantVectorStore:` Cloud Qdrant Vector Store. This class is used to store the configuration for a Qdrant vector store, so that it can be created and used in LlamaCloud. Args: collection_name (str): name of the Qdrant collection url (str): url of the Qdrant instance api_key (str): API key for authenticating with Qdrant max_retries (int): maximum number of retries in case of a failure. Defaults to 3 client_kwargs (dict): additional kwargs to pass to the Qdrant client - `String apiKey` - `String collectionName` - `String url` - `Optional className` - `Optional clientKwargs` - `Optional maxRetries` - `Optional supportsNestedMetadataFilters` - `TRUE(true)` ### Cloud S3 Data Source - `class CloudS3DataSource:` - `String bucket` The name of the S3 bucket to read from. - `Optional awsAccessId` The AWS access ID to use for authentication. - `Optional awsAccessSecret` The AWS access secret to use for authentication. - `Optional className` - `Optional prefix` The prefix of the S3 objects to read from. - `Optional regexPattern` The regex pattern to filter S3 objects. Must be a valid regex pattern. - `Optional s3EndpointUrl` The S3 endpoint URL to use for authentication. - `Optional supportsAccessControl` ### Cloud Sharepoint Data Source - `class CloudSharepointDataSource:` - `String clientId` The client ID to use for authentication. - `String clientSecret` The client secret to use for authentication. - `String tenantId` The tenant ID to use for authentication. - `Optional className` - `Optional driveName` The name of the Sharepoint drive to read from. - `Optional> excludePathPatterns` List of regex patterns for file paths to exclude. Files whose paths (including filename) match any pattern will be excluded. Example: ['/temp/', '/backup/', '.git/', '.tmp$', '^~'] - `Optional folderId` The ID of the Sharepoint folder to read from. - `Optional folderPath` The path of the Sharepoint folder to read from. - `Optional getPermissions` Whether to get permissions for the sharepoint site. - `Optional> includePathPatterns` List of regex patterns for file paths to include. Full paths (including filename) must match at least one pattern to be included. Example: ['/reports/', '/docs/.*.pdf$', '^Report.*.pdf$'] - `Optional> requiredExts` The list of required file extensions. - `Optional siteId` The ID of the SharePoint site to download from. - `Optional siteName` The name of the SharePoint site to download from. - `Optional supportsAccessControl` - `TRUE(true)` ### Cloud Slack Data Source - `class CloudSlackDataSource:` - `String slackToken` Slack Bot Token. - `Optional channelIds` Slack Channel. - `Optional channelPatterns` Slack Channel name pattern. - `Optional className` - `Optional earliestDate` Earliest date. - `Optional earliestDateTimestamp` Earliest date timestamp. - `Optional latestDate` Latest date. - `Optional latestDateTimestamp` Latest date timestamp. - `Optional supportsAccessControl` ### Failure Handling Config - `class FailureHandlingConfig:` Configuration for handling different types of failures during data source processing. - `Optional skipListFailures` Whether to skip failed batches/lists and continue processing ### Pg Vector Hnsw Settings - `class PgVectorHnswSettings:` HNSW settings for PGVector. - `Optional distanceMethod` The distance method to use. - `L2("l2")` - `IP("ip")` - `COSINE("cosine")` - `L1("l1")` - `HAMMING("hamming")` - `JACCARD("jaccard")` - `Optional efConstruction` The number of edges to use during the construction phase. - `Optional efSearch` The number of edges to use during the search phase. - `Optional m` The number of bi-directional links created for each new element. - `Optional vectorType` The type of vector to use. - `VECTOR("vector")` - `HALF_VEC("half_vec")` - `BIT("bit")` - `SPARSE_VEC("sparse_vec")`