Github is a code hosting platform for version control and collaboration, allowing people to work on projects from everywhere. Connecting Github with Glean enables Glean to fetch relevant Github content. Glean understands all user access permissions and strictly enforces permissions for users at the time of the query, which ensures that users are not able to see results which they do not have access to. All data is stored in the GCP project in your cloud account, and no data leaves your environment.
Glean uses the standard API to capture relevant data. For Github, Glean will capture the following content:
- PR descriptions
- PR conversations and comments
- Issue threads
- Commit messages for main branch Additionally, Glean will capture the following from the latest commit on the main branch:
- Directory and file names
- Full content of documentation files only (.md and .txt) Glean does not currently support code search.
If the customer is using Glean Chat, we also index the following:
- Code contents from PR diffs
- Full content of all files from the latest commit and main branch
Engineering teams can find information to resolve an error by using Glean search to find previous cases with similar errors. They can then search through the comments, issue threads, and Slack discussions of those errors.
Users who have just started working on a project can benefit from the Collections feature in Glean, where teams can place files from several different platforms, such as Github and Google Drive, in a single folder.
Add custom data sources into Glean’s enterprise search software using our API to index application content.
Connect other sources of company knowledge, like your customer-facing help center or internal websites.
Glean can also search across workplace apps in your browser history, such as Airtable, Coda, Figma, and more.