-
Notifications
You must be signed in to change notification settings - Fork 109
Add new field extracted_to
to CodebaseResource
#510
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Signed-off-by: Jono Yang <[email protected]>
Signed-off-by: Jono Yang <[email protected]>
Hey can i work on this? |
@TG1999 ^^ |
You sure can work on this! You would need to get familiar on how extractcode works and how extraction works in SCanCode.io. It extracts files to a directory. Keep the extracted_to would be about keeping track of which an archive is extracted to. |
Closed in favor of #827 |
In #485, we have an issue where we get two DiscoveredPackages for the same package when we scan a pypi wheel using the
scan_codebase
pipeline. This is happening because we report a Package detected from the wheel itself, and then we create another Package from the extracted METADATA file from the wheel. A way to avoid this would be for scancode.io to know where archives were extracted to. This way, if we detect that an archive is a Package, then we can easily tag its extracted contents as being part of that package. Alternatively, if we detect that an extracted archive is a package itself, then we can easily tag the archive as part of the package.The text was updated successfully, but these errors were encountered: