Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: clone bare repositories without blobs #269

Merged
merged 2 commits into from
May 17, 2022
Merged

Conversation

tensor5
Copy link
Contributor

@tensor5 tensor5 commented May 5, 2022

In order to save disk space, make bare clones of repositories filtering out blobs.

Fixes #249

@tensor5
Copy link
Contributor Author

tensor5 commented May 12, 2022

Things to test:

  • CalculateRepoActivity works with bare repositories
  • the flag --filter=blob:none does not cause problems with repos using an older version of git.

In order to save disk space, make bare clones of repositories filtering
out blobs.
@tensor5 tensor5 marked this pull request as ready for review May 16, 2022 08:53
@bfabio
Copy link
Member

bfabio commented May 16, 2022

Once this is in, we'll need to remove old clones to make the crawler re-clone the repos from scratch, right?

@bfabio bfabio self-requested a review May 17, 2022 07:11
@bfabio bfabio merged commit 7238d10 into master May 17, 2022
@bfabio bfabio deleted the clone-bare-no-blobs branch May 17, 2022 07:22
bfabio added a commit to bfabio/publiccode-crawler that referenced this pull request Mar 3, 2023
bfabio added a commit to bfabio/publiccode-crawler that referenced this pull request Mar 3, 2023
bfabio added a commit to bfabio/publiccode-crawler that referenced this pull request Mar 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Don't clone all the repo contents when crawling
2 participants