Knowledge Contribution Details
You can create a Git repository to host your knowledge contributions anywhere (GitLab, Gerrit, etc.) but it might be favorable to create one on GitHub. The following instructions show you how to create a knowledge repository in GitHub and contribute to the taxonomy.
Prerequisites¶
- You have a GitHub account
- You have a forked copy of the taxonomy repository
- You have verified that the model does not already know the knowledge you want to submit
Creating your own knowledge repository¶
To create a new GitHub repository, follow the GitHub documentation in Creating a new repository.
The specific steps are listed as follows:
- In your GitHub profile page, navigate to the repositories tab. You will see a search bar where you can search your repositories or create a new one.
- This takes you to a page titled âCreate a new repositoryâ. Create a custom name for your repository and add a
README.md
file. For example, âknowlege_contributionsâ could be a good name for your repository. - Click âCreateâ when you are all set.
Convert your knowledge documentation to markdown¶
There are many online tools that can help you convert your documents to markdown. If you are using a wiki page for your contributions, you can use pandocs to convert the documents. For wikipedia sources on pandoc, use from: mediawiki
and convert to: markdown_strict
to access the proper markdown format.
Add the markdown file to your repository¶
To add a file to your GitHub repository, follow the GitHub documentation in Adding a file to a repository.
The specific steps are listed as follows:
- Navigate to âAdd filesâ. Click âCreate new fileâ if you want to manually add your markdown content. Click âUpload filesâ if you have a file locally to add.
-
Add a description and commit your changes.
Since this is your own repository, you can commit directly to the
main
branch. -
You can then see your new content in your repository.
Important
Make a note of your commit SHA; you'll need it for your qna.yaml
.
Create a pull request in the taxonomy repository¶
Navigate to your forked taxonomy repository and ensure it is up-to-date.
There are a few ways you can create a pull request:
- For details on the local process, check out The GitHub Workflow Guide in the Kubernetes documentation and the GitHub flow in the GitHub documentation.
- For details on contributing using the GitHub webpage UI, see Contributing using the GH UI or Creating a pull request in the GitHub documentation.
Verification¶
Here are a few things to check before seeking reviews for your contribution:
- Your
qna.yaml
follows the proper formatting. See examples in Knowledge: YAML examples - Ensure all parameters are set. Especially the
document
,repo
,commit
andpattern
keys; these parameters are specific to knowledge contributions and require more analysis. - Include an
attribution.txt
file for citing your sources. see For your attribution.txt file for more information.
PR Upstream Workflow¶
The following table outlines the expected timing for the PRs you have submitted. The PRs go through a few steps, and checks, but you should be able to map your label
to
the place that it is in.
Label | Actor | Action | Duration |
---|---|---|---|
Contributor | Submit PR | - | |
Contributor | Fix failed PR checks | - | |
https://github.com/instructlab/taxonomy/labels/triage-needed | Triager | Review PR, ask for changes | Days |
https://github.com/instructlab/taxonomy/labels/triage-requested-changes | Contributor | Make requested changes | Days |
https://github.com/instructlab/taxonomy/labels/precheck-generate-ready | Triager | Run prechecks and generate | Days |
https://github.com/instructlab/taxonomy/labels/community-build-ready | Backend | Model gets retrained | Weeks |
Triager | Check the numbers and PR merged or closed | - |