Skip to content

Add new field extracted_to to CodebaseResource #510

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
JonoYang opened this issue Aug 23, 2022 · 4 comments
Closed

Add new field extracted_to to CodebaseResource #510

JonoYang opened this issue Aug 23, 2022 · 4 comments

Comments

@JonoYang
Copy link
Member

In #485, we have an issue where we get two DiscoveredPackages for the same package when we scan a pypi wheel using the scan_codebase pipeline. This is happening because we report a Package detected from the wheel itself, and then we create another Package from the extracted METADATA file from the wheel. A way to avoid this would be for scancode.io to know where archives were extracted to. This way, if we detect that an archive is a Package, then we can easily tag its extracted contents as being part of that package. Alternatively, if we detect that an extracted archive is a package itself, then we can easily tag the archive as part of the package.

JonoYang added a commit that referenced this issue Aug 24, 2022
JonoYang added a commit that referenced this issue Aug 24, 2022
@uzaxirr
Copy link

uzaxirr commented Jan 19, 2023

Hey can i work on this?
Also can you please describe what the extracted_to field would look like? will it be a ForeignKey or what?

@uzaxirr
Copy link

uzaxirr commented Jan 28, 2023

@TG1999 ^^

@pombredanne
Copy link
Member

You sure can work on this! You would need to get familiar on how extractcode works and how extraction works in SCanCode.io. It extracts files to a directory. Keep the extracted_to would be about keeping track of which an archive is extracted to.

@pombredanne
Copy link
Member

Closed in favor of #827

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants