sourcebot/packages/mcp/README.md

# Sourcebot MCP - Fetch code context from GitHub, GitLab, Bitbucket, and more

[![Sourcebot](https://img.shields.io/badge/Website-sourcebot.dev-blue)](https://sourcebot.dev)
[![GitHub](https://img.shields.io/badge/GitHub-sourcebot--dev%2Fsourcebot-green?logo=github)](https://github.com/sourcebot-dev/sourcebot)
[![Docs](https://img.shields.io/badge/Docs-docs.sourcebot.dev-yellow)](https://docs.sourcebot.dev/docs/features/mcp-server)
[![npm](https://img.shields.io/npm/v/@sourcebot/mcp)](https://www.npmjs.com/package/@sourcebot/mcp)

The Sourcebot MCP server gives your LLM agents the ability to fetch code context across thousands of repos hosted on [GitHub](https://docs.sourcebot.dev/docs/connections/github), [GitLab](https://docs.sourcebot.dev/docs/connections/gitlab), [BitBucket](https://docs.sourcebot.dev/docs/connections/bitbucket-cloud) and [more](#supported-code-hosts). Ask your LLM a question, and the Sourcebot MCP server will fetch relevant context from its index and inject it into your chat session. Some use cases this unlocks include:

- Enriching responses to user requests:
    - _"What repositories are using internal library X?"_
    - _"Provide usage examples of the CodeMirror component"_
    - _"Where is the `useCodeMirrorTheme` hook defined?"_
    - _"Find all usages of `deprecatedApi` across all repos"_

- Improving reasoning ability for existing horizontal agents like AI code review, docs generation, etc.
    - _"Find the definitions for all functions in this diff"_
    - _"Document what systems depend on this class"_

- Building custom LLM horizontal agents like like compliance auditing agents, migration agents, etc.
    - _"Find all instances of hardcoded credentials"_
    - _"Identify repositories that depend on this deprecated api"_


## Getting Started

1. Install Node.JS >= v18.0.0.

2. (optional) Spin up a Sourcebot instance by following [this guide](https://docs.sourcebot.dev/self-hosting/overview). The host url of your instance (e.g., `http://localhost:3000`) is passed to the MCP server via the `SOURCEBOT_HOST` url. This allows you to control which repos Sourcebot MCP fetches context from (including private repos). 

    If a host is not provided, then the server will fallback to using the demo instance hosted at https://demo.sourcebot.dev. You can see the list of repositories indexed [here](https://demo.sourcebot.dev/~/repos). Add additional repositories by [opening a PR](https://github.com/sourcebot-dev/sourcebot/blob/main/demo-site-config.json).

3. Install `@sourcebot/mcp` into your MCP client:

    <details>
    <summary>Cursor</summary>

    [Cursor MCP docs](https://docs.cursor.com/context/model-context-protocol)

    Go to: `Settings` -> `Cursor Settings` -> `MCP` -> `Add new global MCP server`

    Paste the following into your `~/.cursor/mcp.json` file. This will install Sourcebot globally within Cursor:

    ```json
    {
        "mcpServers": {
            "sourcebot": {
                "command": "npx",
                "args": ["-y", "@sourcebot/mcp@latest" ],
                // Optional - if not specified, https://demo.sourcebot.dev is used
                "env": {
                    "SOURCEBOT_HOST": "http://localhost:3000"
                }
            }
        }
    }
    ```
    </details>

    <details>
    <summary>Windsurf</summary>

    [Windsurf MCP docs](https://docs.windsurf.com/windsurf/mcp)

    Go to: `Windsurf Settings` -> `Cascade` -> `Add Server` -> `Add Custom Server`

    Paste the following into your `mcp_config.json` file:

    ```json
    {
        "mcpServers": {
            "sourcebot": {
                "command": "npx",
                "args": ["-y", "@sourcebot/mcp@latest" ],
                // Optional - if not specified, https://demo.sourcebot.dev is used
                "env": {
                    "SOURCEBOT_HOST": "http://localhost:3000"
                }
            }
        }
    }
    ```
    </details>

    <details>
    <summary>VS Code</summary>

    [VS Code MCP docs](https://code.visualstudio.com/docs/copilot/chat/mcp-servers)

    Add the following to your [.vscode/mcp.json](https://code.visualstudio.com/docs/copilot/chat/mcp-servers#_add-an-mcp-server-to-your-workspace) file:

    ```json
    {
        "servers": {
            "sourcebot": {
                "type": "stdio",
                "command": "npx",
                "args": ["-y", "@sourcebot/mcp@latest"],
                // Optional - if not specified, https://demo.sourcebot.dev is used
                "env": {
                    "SOURCEBOT_HOST": "http://localhost:3000"
                }
            }
        }
    }
    ```

    </details>

    <details>
    <summary>Claude Code</summary>

    [Claude Code MCP docs](https://docs.anthropic.com/en/docs/claude-code/tutorials#set-up-model-context-protocol-mcp)

    Run the following command:

    ```sh
    # SOURCEBOT_HOST env var is optional - if not specified,
    # https://demo.sourcebot.dev is used.
    claude mcp add sourcebot -e SOURCEBOT_HOST=http://localhost:3000 -- npx -y @sourcebot/mcp@latest
    ```
    </details>

    <details>
    <summary>Claude Desktop</summary>

    [Claude Desktop MCP docs](https://modelcontextprotocol.io/quickstart/user)

    Add the following to your `claude_desktop_config.json`:

    ```json
    {
        "mcpServers": {
            "sourcebot": {
                "command": "npx",
                "args": ["-y", "@sourcebot/mcp@latest"],
                // Optional - if not specified, https://demo.sourcebot.dev is used
                "env": {
                    "SOURCEBOT_HOST": "http://localhost:3000"
                }
            }
        }
    }
    ```
    </details>
    <br/>

    Alternatively, you can install using via [Smithery](https://smithery.ai/server/@sourcebot-dev/sourcebot). For example:

    ```bash
    npx -y @smithery/cli install @sourcebot-dev/sourcebot --client claude
    ```

<br/>

4. Tell your LLM to `use sourcebot` when prompting.

<br/>

For a more detailed guide, checkout [the docs](https://docs.sourcebot.dev/docs/features/mcp-server).


## Available Tools

### search_code

Fetches code that matches the provided regex pattern in `query`.

**Temporal Filtering**: Use `since` and `until` to filter by repository index time (when Sourcebot last indexed the repo). This is different from commit time. See `search_commits` for commit-time filtering.

<details>
<summary>Parameters</summary>

| Name                  | Required | Description                                                                                                                       |
|:----------------------|:---------|:----------------------------------------------------------------------------------------------------------------------------------|
| `query`               | yes      | Regex pattern to search for. Escape special characters and spaces with a single backslash (e.g., 'console\.log', 'console\ log'). |
| `filterByRepoIds`     | no       | Restrict search to specific repository IDs (from 'list_repos'). Leave empty to search all.                                        |
| `filterByLanguages`   | no       | Restrict search to specific languages (GitHub linguist format, e.g., Python, JavaScript).                                         |
| `caseSensitive`       | no       | Case sensitive search (default: false).                                                                                           |
| `includeCodeSnippets` | no       | Include code snippets in results (default: false).                                                                                |
| `gitRevision`         | no       | Git revision to search (e.g., 'main', 'develop', 'v1.0.0'). Defaults to HEAD.                                                    |
| `since`               | no       | Only search repos indexed after this date. Supports ISO 8601 or relative (e.g., "30 days ago").                                   |
| `until`               | no       | Only search repos indexed before this date. Supports ISO 8601 or relative (e.g., "yesterday").                                    |
| `maxTokens`           | no       | Max tokens to return (default: env.DEFAULT_MINIMUM_TOKENS).                                                                       |
</details>


### list_repos

Lists repositories indexed by Sourcebot with optional filtering and pagination.

**Temporal Filtering**: Use `activeAfter` and `activeBefore` to filter by repository index time (when Sourcebot last indexed the repo). This is the same filtering behavior as `search_code`'s `since`/`until` parameters.

<details>
<summary>Parameters</summary>

| Name            | Required | Description                                                                                    |
|:----------------|:---------|:-----------------------------------------------------------------------------------------------|
| `query`         | no       | Filter repositories by name (case-insensitive).                                                |
| `pageNumber`    | no       | Page number (1-indexed, default: 1).                                                           |
| `limit`         | no       | Number of repositories per page (default: 50).                                                 |
| `activeAfter`   | no       | Only return repos indexed after this date. Supports ISO 8601 or relative (e.g., "30 days ago"). |
| `activeBefore`  | no       | Only return repos indexed before this date. Supports ISO 8601 or relative (e.g., "yesterday").  |

</details>

### get_file_source

Fetches the source code for a given file.

<details>
<summary>Parameters</summary>

| Name         | Required | Description                                                      |
|:-------------|:---------|:-----------------------------------------------------------------|
| `fileName`   | yes      | The file to fetch the source code for.                           |
| `repoId`     | yes      | The Sourcebot repository ID.                                     |
</details>

### search_commits

Searches for commits in a specific repository based on actual commit time (NOT index time).

**Requirements**: Repository must be cloned on the Sourcebot server disk. Sourcebot automatically clones repositories during indexing, but the cloning process may not be finished when this query is executed. Use `list_repos` first to get the repository ID.

**Date Formats**: Supports ISO 8601 dates (e.g., "2024-01-01") and relative formats (e.g., "30 days ago", "last week", "yesterday").

<details>
<summary>Parameters</summary>

| Name       | Required | Description                                                                                    |
|:-----------|:---------|:-----------------------------------------------------------------------------------------------|
| `repoId`   | yes      | Repository identifier: either numeric database ID (e.g., 123) or full repository name (e.g., "github.com/owner/repo") as returned by `list_repos`. |
| `query`    | no       | Search query to filter commits by message (case-insensitive).                                  |
| `since`    | no       | Show commits after this date (by commit time). Supports ISO 8601 or relative formats.          |
| `until`    | no       | Show commits before this date (by commit time). Supports ISO 8601 or relative formats.         |
| `author`   | no       | Filter by author name or email (supports partial matches).                                     |
| `maxCount` | no       | Maximum number of commits to return (default: 50).                                             |

</details>

## Date Format Examples

All temporal parameters support:
- **ISO 8601**: `"2024-01-01"`, `"2024-12-31T23:59:59Z"`
- **Relative dates**: `"30 days ago"`, `"1 week ago"`, `"last month"`, `"yesterday"`

**Important**: Different tools filter by different time dimensions:
- `search_code` `since`/`until`: Filters by **index time** (when Sourcebot indexed the repo)
- `list_repos` `activeAfter`/`activeBefore`: Filters by **index time** (when Sourcebot indexed the repo)
- `search_commits` `since`/`until`: Filters by **commit time** (actual git commit dates)


## Supported Code Hosts
Sourcebot supports the following code hosts:
- [GitHub](https://docs.sourcebot.dev/docs/connections/github)
- [GitLab](https://docs.sourcebot.dev/docs/connections/gitlab)
- [Bitbucket Cloud](https://docs.sourcebot.dev/docs/connections/bitbucket-cloud)
- [Bitbucket Data Center](https://docs.sourcebot.dev/docs/connections/bitbucket-data-center)
- [Gitea](https://docs.sourcebot.dev/docs/connections/gitea)
- [Gerrit](https://docs.sourcebot.dev/docs/connections/gerrit)

| Don't see your code host? Open a [feature request](https://github.com/sourcebot-dev/sourcebot/issues/new?template=feature_request.md).

## Future Work

### Semantic Search

Currently, Sourcebot only supports regex-based code search (powered by [zoekt](https://github.com/sourcegraph/zoekt) under the hood). It is great for scenarios when the agent is searching for is something that is super precise and well-represented in the source code (e.g., a specific function name, a error string, etc.). It is not-so-great for _fuzzy_ searches where the objective is to find some loosely defined _category_ or _concept_ in the code (e.g., find code that verifies JWT tokens). The LLM can approximate this by crafting regex searches that attempt to capture a concept (e.g., it might try a query like `"jwt|token|(verify|validate).*(jwt|token)"`), but often yields sub-optimal search results that aren't related. Tools like Cursor solve this with [embedding models](https://docs.cursor.com/context/codebase-indexing) to capture the semantic meaning of code, allowing for LLMs to search using natural language. We would like to extend Sourcebot to support semantic search and expose this capability over MCP as a tool (e.g., `semantic_search_code` tool). [GitHub Discussion](https://github.com/sourcebot-dev/sourcebot/discussions/297)
update MCP README 2025-05-14 21:05:42 +00:00			`# Sourcebot MCP - Fetch code context from GitHub, GitLab, Bitbucket, and more`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			`[![Sourcebot](https://img.shields.io/badge/Website-sourcebot.dev-blue)](https://sourcebot.dev)`
			`[![GitHub](https://img.shields.io/badge/GitHub-sourcebot--dev%2Fsourcebot-green?logo=github)](https://github.com/sourcebot-dev/sourcebot)`
V4 docs refactor (#322) 2025-06-02 01:51:12 +00:00			`[![Docs](https://img.shields.io/badge/Docs-docs.sourcebot.dev-yellow)](https://docs.sourcebot.dev/docs/features/mcp-server)`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00			`[![npm](https://img.shields.io/npm/v/@sourcebot/mcp)](https://www.npmjs.com/package/@sourcebot/mcp)`

update MCP README 2025-05-14 21:05:42 +00:00			The Sourcebot MCP server gives your LLM agents the ability to fetch code context across thousands of repos hosted on [GitHub](https://docs.sourcebot.dev/docs/connections/github), [GitLab](https://docs.sourcebot.dev/docs/connections/gitlab), [BitBucket](https://docs.sourcebot.dev/docs/connections/bitbucket-cloud) and [more](#supported-code-hosts). Ask your LLM a question, and the Sourcebot MCP server will fetch relevant context from its index and inject it into your chat session. Some use cases this unlocks include:
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			`- Enriching responses to user requests:`
			`- _"What repositories are using internal library X?"_`
			`- _"Provide usage examples of the CodeMirror component"_`
			- _"Where is the `useCodeMirrorTheme` hook defined?"_
			- _"Find all usages of `deprecatedApi` across all repos"_

			`- Improving reasoning ability for existing horizontal agents like AI code review, docs generation, etc.`
			`- _"Find the definitions for all functions in this diff"_`
			`- _"Document what systems depend on this class"_`

			`- Building custom LLM horizontal agents like like compliance auditing agents, migration agents, etc.`
			`- _"Find all instances of hardcoded credentials"_`
chore: Misc typos (UI, docs, code...), Makefile PATH with spaces (#369) * chore: Fix misc typos (UI, docs, code...) * chore(dev): Support PATH with spaces in Makefile E.g. `Application Support` on MacOS * chore: Typos in schema v2 description * chore: more typos * chore(dev): Add _typos.toml 2025-07-16 18:59:01 +00:00			`- _"Identify repositories that depend on this deprecated api"_`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00

			`## Getting Started`

			`1. Install Node.JS >= v18.0.0.`

update MCP README 2025-05-14 21:05:42 +00:00			2. (optional) Spin up a Sourcebot instance by following [this guide](https://docs.sourcebot.dev/self-hosting/overview). The host url of your instance (e.g., `http://localhost:3000`) is passed to the MCP server via the `SOURCEBOT_HOST` url. This allows you to control which repos Sourcebot MCP fetches context from (including private repos).
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			`If a host is not provided, then the server will fallback to using the demo instance hosted at https://demo.sourcebot.dev. You can see the list of repositories indexed [here](https://demo.sourcebot.dev/~/repos). Add additional repositories by [opening a PR](https://github.com/sourcebot-dev/sourcebot/blob/main/demo-site-config.json).`

			3. Install `@sourcebot/mcp` into your MCP client:

			`<details>`
			`<summary>Cursor</summary>`

			`[Cursor MCP docs](https://docs.cursor.com/context/model-context-protocol)`

			Go to: `Settings` -> `Cursor Settings` -> `MCP` -> `Add new global MCP server`

			Paste the following into your `~/.cursor/mcp.json` file. This will install Sourcebot globally within Cursor:

			```json
			`{`
			`"mcpServers": {`
			`"sourcebot": {`
			`"command": "npx",`
			`"args": ["-y", "@sourcebot/mcp@latest" ],`
			`// Optional - if not specified, https://demo.sourcebot.dev is used`
			`"env": {`
			`"SOURCEBOT_HOST": "http://localhost:3000"`
			`}`
			`}`
			`}`
			`}`
			```
			`</details>`

			`<details>`
			`<summary>Windsurf</summary>`

			`[Windsurf MCP docs](https://docs.windsurf.com/windsurf/mcp)`

			Go to: `Windsurf Settings` -> `Cascade` -> `Add Server` -> `Add Custom Server`

			Paste the following into your `mcp_config.json` file:

			```json
			`{`
			`"mcpServers": {`
			`"sourcebot": {`
			`"command": "npx",`
			`"args": ["-y", "@sourcebot/mcp@latest" ],`
			`// Optional - if not specified, https://demo.sourcebot.dev is used`
			`"env": {`
			`"SOURCEBOT_HOST": "http://localhost:3000"`
			`}`
			`}`
			`}`
			`}`
			```
			`</details>`

			`<details>`
			`<summary>VS Code</summary>`

			`[VS Code MCP docs](https://code.visualstudio.com/docs/copilot/chat/mcp-servers)`

chore: Update docs for vscode MCP GA 2025-07-15 15:53:14 +00:00			`Add the following to your [.vscode/mcp.json](https://code.visualstudio.com/docs/copilot/chat/mcp-servers#_add-an-mcp-server-to-your-workspace) file:`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			```json
			`{`
chore: Update docs for vscode MCP GA 2025-07-15 15:53:14 +00:00			`"servers": {`
			`"sourcebot": {`
			`"type": "stdio",`
			`"command": "npx",`
			`"args": ["-y", "@sourcebot/mcp@latest"],`
			`// Optional - if not specified, https://demo.sourcebot.dev is used`
			`"env": {`
			`"SOURCEBOT_HOST": "http://localhost:3000"`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00			`}`
			`}`
			`}`
			`}`
			```

			`</details>`

			`<details>`
			`<summary>Claude Code</summary>`

			`[Claude Code MCP docs](https://docs.anthropic.com/en/docs/claude-code/tutorials#set-up-model-context-protocol-mcp)`

			`Run the following command:`

			```sh
			`# SOURCEBOT_HOST env var is optional - if not specified,`
			`# https://demo.sourcebot.dev is used.`
			`claude mcp add sourcebot -e SOURCEBOT_HOST=http://localhost:3000 -- npx -y @sourcebot/mcp@latest`
			```
			`</details>`

			`<details>`
			`<summary>Claude Desktop</summary>`

			`[Claude Desktop MCP docs](https://modelcontextprotocol.io/quickstart/user)`

			Add the following to your `claude_desktop_config.json`:

			```json
			`{`
			`"mcpServers": {`
			`"sourcebot": {`
			`"command": "npx",`
			`"args": ["-y", "@sourcebot/mcp@latest"],`
			`// Optional - if not specified, https://demo.sourcebot.dev is used`
			`"env": {`
			`"SOURCEBOT_HOST": "http://localhost:3000"`
			`}`
			`}`
			`}`
			`}`
			```
			`</details>`
[packages/mcp] deployment: Dockerfile and Smithery config (#300) 2025-05-08 16:54:36 +00:00			`<br/>`

			`Alternatively, you can install using via [Smithery](https://smithery.ai/server/@sourcebot-dev/sourcebot). For example:`

			```bash
			`npx -y @smithery/cli install @sourcebot-dev/sourcebot --client claude`
			```

Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00			`<br/>`

			4. Tell your LLM to `use sourcebot` when prompting.

			`<br/>`

V4 docs refactor (#322) 2025-06-02 01:51:12 +00:00			`For a more detailed guide, checkout [the docs](https://docs.sourcebot.dev/docs/features/mcp-server).`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00

			`## Available Tools`

			`### search_code`

			Fetches code that matches the provided regex pattern in `query`.

feat: add temporal filtering to search and repository APIs Add temporal filtering capabilities for searches by git branch/revision and repository index dates (since/until). Integrates with the refactored QueryIR-based search architecture. - Add gitRevision, since, until parameters to SearchOptions - Implement temporal repo filtering by indexedAt field - Add branch filtering via QueryIR wrapper - Add search_commits MCP tool for commit-based searches - Update list_repos with activeAfter/activeBefore filtering - Add 88 new tests (all passing) Signed-off-by: Wayne Sun <gsun@redhat.com> 2025-11-24 01:07:20 +00:00			Temporal Filtering: Use `since` and `until` to filter by repository index time (when Sourcebot last indexed the repo). This is different from commit time. See `search_commits` for commit-time filtering.

Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00			`<details>`
			`<summary>Parameters</summary>`

			`\| Name \| Required \| Description \|`
			`\|:----------------------\|:---------\|:----------------------------------------------------------------------------------------------------------------------------------\|`
			\| `query` \| yes \| Regex pattern to search for. Escape special characters and spaces with a single backslash (e.g., 'console\.log', 'console\ log'). \|
			\| `filterByRepoIds` \| no \| Restrict search to specific repository IDs (from 'list_repos'). Leave empty to search all. \|
			\| `filterByLanguages` \| no \| Restrict search to specific languages (GitHub linguist format, e.g., Python, JavaScript). \|
			\| `caseSensitive` \| no \| Case sensitive search (default: false). \|
			\| `includeCodeSnippets` \| no \| Include code snippets in results (default: false). \|
feat: add temporal filtering to search and repository APIs Add temporal filtering capabilities for searches by git branch/revision and repository index dates (since/until). Integrates with the refactored QueryIR-based search architecture. - Add gitRevision, since, until parameters to SearchOptions - Implement temporal repo filtering by indexedAt field - Add branch filtering via QueryIR wrapper - Add search_commits MCP tool for commit-based searches - Update list_repos with activeAfter/activeBefore filtering - Add 88 new tests (all passing) Signed-off-by: Wayne Sun <gsun@redhat.com> 2025-11-24 01:07:20 +00:00			\| `gitRevision` \| no \| Git revision to search (e.g., 'main', 'develop', 'v1.0.0'). Defaults to HEAD. \|
			\| `since` \| no \| Only search repos indexed after this date. Supports ISO 8601 or relative (e.g., "30 days ago"). \|
			\| `until` \| no \| Only search repos indexed before this date. Supports ISO 8601 or relative (e.g., "yesterday"). \|
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00			\| `maxTokens` \| no \| Max tokens to return (default: env.DEFAULT_MINIMUM_TOKENS). \|
			`</details>`


			`### list_repos`

feat(mcp): Add pagination and filtering to list_repos tool (#614) * feat(mcp): Add pagination and filtering to list_repos tool Fixes #566 - Add query parameter to filter repositories by name - Add pageNumber and limit parameters for pagination - Include pagination info in response when applicable - Add listReposRequestSchema for request validation - Update README with new list_repos parameters * feat(mcp): Sort repositories alphabetically for consistent pagination Fixes #566 - Updated CHANGELOG.md with pagination and filtering changes --------- Co-authored-by: Brendan Kellam <bshizzle1234@gmail.com> 2025-11-18 01:08:20 +00:00			`Lists repositories indexed by Sourcebot with optional filtering and pagination.`

feat: add temporal filtering to search and repository APIs Add temporal filtering capabilities for searches by git branch/revision and repository index dates (since/until). Integrates with the refactored QueryIR-based search architecture. - Add gitRevision, since, until parameters to SearchOptions - Implement temporal repo filtering by indexedAt field - Add branch filtering via QueryIR wrapper - Add search_commits MCP tool for commit-based searches - Update list_repos with activeAfter/activeBefore filtering - Add 88 new tests (all passing) Signed-off-by: Wayne Sun <gsun@redhat.com> 2025-11-24 01:07:20 +00:00			Temporal Filtering: Use `activeAfter` and `activeBefore` to filter by repository index time (when Sourcebot last indexed the repo). This is the same filtering behavior as `search_code`'s `since`/`until` parameters.

feat(mcp): Add pagination and filtering to list_repos tool (#614) * feat(mcp): Add pagination and filtering to list_repos tool Fixes #566 - Add query parameter to filter repositories by name - Add pageNumber and limit parameters for pagination - Include pagination info in response when applicable - Add listReposRequestSchema for request validation - Update README with new list_repos parameters * feat(mcp): Sort repositories alphabetically for consistent pagination Fixes #566 - Updated CHANGELOG.md with pagination and filtering changes --------- Co-authored-by: Brendan Kellam <bshizzle1234@gmail.com> 2025-11-18 01:08:20 +00:00			`<details>`
			`<summary>Parameters</summary>`

feat: add temporal filtering to search and repository APIs Add temporal filtering capabilities for searches by git branch/revision and repository index dates (since/until). Integrates with the refactored QueryIR-based search architecture. - Add gitRevision, since, until parameters to SearchOptions - Implement temporal repo filtering by indexedAt field - Add branch filtering via QueryIR wrapper - Add search_commits MCP tool for commit-based searches - Update list_repos with activeAfter/activeBefore filtering - Add 88 new tests (all passing) Signed-off-by: Wayne Sun <gsun@redhat.com> 2025-11-24 01:07:20 +00:00			`\| Name \| Required \| Description \|`
			`\|:----------------\|:---------\|:-----------------------------------------------------------------------------------------------\|`
			\| `query` \| no \| Filter repositories by name (case-insensitive). \|
			\| `pageNumber` \| no \| Page number (1-indexed, default: 1). \|
			\| `limit` \| no \| Number of repositories per page (default: 50). \|
			\| `activeAfter` \| no \| Only return repos indexed after this date. Supports ISO 8601 or relative (e.g., "30 days ago"). \|
			\| `activeBefore` \| no \| Only return repos indexed before this date. Supports ISO 8601 or relative (e.g., "yesterday"). \|
feat(mcp): Add pagination and filtering to list_repos tool (#614) * feat(mcp): Add pagination and filtering to list_repos tool Fixes #566 - Add query parameter to filter repositories by name - Add pageNumber and limit parameters for pagination - Include pagination info in response when applicable - Add listReposRequestSchema for request validation - Update README with new list_repos parameters * feat(mcp): Sort repositories alphabetically for consistent pagination Fixes #566 - Updated CHANGELOG.md with pagination and filtering changes --------- Co-authored-by: Brendan Kellam <bshizzle1234@gmail.com> 2025-11-18 01:08:20 +00:00
			`</details>`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			`### get_file_source`

			`Fetches the source code for a given file.`

			`<details>`
			`<summary>Parameters</summary>`

			`\| Name \| Required \| Description \|`
			`\|:-------------\|:---------\|:-----------------------------------------------------------------\|`
			\| `fileName` \| yes \| The file to fetch the source code for. \|
			\| `repoId` \| yes \| The Sourcebot repository ID. \|
			`</details>`

feat: add temporal filtering to search and repository APIs Add temporal filtering capabilities for searches by git branch/revision and repository index dates (since/until). Integrates with the refactored QueryIR-based search architecture. - Add gitRevision, since, until parameters to SearchOptions - Implement temporal repo filtering by indexedAt field - Add branch filtering via QueryIR wrapper - Add search_commits MCP tool for commit-based searches - Update list_repos with activeAfter/activeBefore filtering - Add 88 new tests (all passing) Signed-off-by: Wayne Sun <gsun@redhat.com> 2025-11-24 01:07:20 +00:00			`### search_commits`

			`Searches for commits in a specific repository based on actual commit time (NOT index time).`

			Requirements: Repository must be cloned on the Sourcebot server disk. Sourcebot automatically clones repositories during indexing, but the cloning process may not be finished when this query is executed. Use `list_repos` first to get the repository ID.

			`Date Formats: Supports ISO 8601 dates (e.g., "2024-01-01") and relative formats (e.g., "30 days ago", "last week", "yesterday").`

			`<details>`
			`<summary>Parameters</summary>`

			`\| Name \| Required \| Description \|`
			`\|:-----------\|:---------\|:-----------------------------------------------------------------------------------------------\|`
			\| `repoId` \| yes \| Repository identifier: either numeric database ID (e.g., 123) or full repository name (e.g., "github.com/owner/repo") as returned by `list_repos`. \|
			\| `query` \| no \| Search query to filter commits by message (case-insensitive). \|
			\| `since` \| no \| Show commits after this date (by commit time). Supports ISO 8601 or relative formats. \|
			\| `until` \| no \| Show commits before this date (by commit time). Supports ISO 8601 or relative formats. \|
			\| `author` \| no \| Filter by author name or email (supports partial matches). \|
			\| `maxCount` \| no \| Maximum number of commits to return (default: 50). \|

			`</details>`

			`## Date Format Examples`

			`All temporal parameters support:`
			- ISO 8601: `"2024-01-01"`, `"2024-12-31T23:59:59Z"`
			- Relative dates: `"30 days ago"`, `"1 week ago"`, `"last month"`, `"yesterday"`

			`Important: Different tools filter by different time dimensions:`
			- `search_code` `since`/`until`: Filters by index time (when Sourcebot indexed the repo)
			- `list_repos` `activeAfter`/`activeBefore`: Filters by index time (when Sourcebot indexed the repo)
			- `search_commits` `since`/`until`: Filters by commit time (actual git commit dates)

Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			`## Supported Code Hosts`
			`Sourcebot supports the following code hosts:`
			`- [GitHub](https://docs.sourcebot.dev/docs/connections/github)`
			`- [GitLab](https://docs.sourcebot.dev/docs/connections/gitlab)`
			`- [Bitbucket Cloud](https://docs.sourcebot.dev/docs/connections/bitbucket-cloud)`
			`- [Bitbucket Data Center](https://docs.sourcebot.dev/docs/connections/bitbucket-data-center)`
			`- [Gitea](https://docs.sourcebot.dev/docs/connections/gitea)`
			`- [Gerrit](https://docs.sourcebot.dev/docs/connections/gerrit)`

Update references to github discussions over to issues 2025-08-12 18:50:51 +00:00			`\| Don't see your code host? Open a [feature request](https://github.com/sourcebot-dev/sourcebot/issues/new?template=feature_request.md).`
Sourcebot MCP (#292) 2025-05-07 23:21:05 +00:00
			`## Future Work`

			`### Semantic Search`

			Currently, Sourcebot only supports regex-based code search (powered by [zoekt](https://github.com/sourcegraph/zoekt) under the hood). It is great for scenarios when the agent is searching for is something that is super precise and well-represented in the source code (e.g., a specific function name, a error string, etc.). It is not-so-great for _fuzzy_ searches where the objective is to find some loosely defined _category_ or _concept_ in the code (e.g., find code that verifies JWT tokens). The LLM can approximate this by crafting regex searches that attempt to capture a concept (e.g., it might try a query like `"jwt\|token\|(verify\|validate).*(jwt\|token)"`), but often yields sub-optimal search results that aren't related. Tools like Cursor solve this with [embedding models](https://docs.cursor.com/context/codebase-indexing) to capture the semantic meaning of code, allowing for LLMs to search using natural language. We would like to extend Sourcebot to support semantic search and expose this capability over MCP as a tool (e.g., `semantic_search_code` tool). [GitHub Discussion](https://github.com/sourcebot-dev/sourcebot/discussions/297)