-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
adds dependency analysis #18543
adds dependency analysis #18543
Conversation
graph still looks like you're cutting at github/org not github/org/repo |
2437936
to
400edcd
Compare
remove all |
tools/depcheck/pkg/cmd/trace/pkg.go
Outdated
@@ -91,7 +91,10 @@ func BuildGraph(packages *PackageList, excludes []string) (*depgraph.MutableDire | |||
|
|||
to, exists := g.NodeByName(dependency) | |||
if !exists { | |||
return nil, fmt.Errorf("expected child node for dependency %q was not found in graph", dependency) | |||
// if a package imports a dependency that we did not visit |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this was supposed to end up in a separate pull. did I miss it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Opened a pull here to begin addressing this #18606
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Opened a pull here to begin addressing this #18606
whoa, that pull is big. I was expecting it to be 5 lines long.
tools/depcheck/pkg/cmd/trace/pkg.go
Outdated
@@ -108,6 +111,55 @@ func BuildGraph(packages *PackageList, excludes []string) (*depgraph.MutableDire | |||
return g, nil | |||
} | |||
|
|||
func copyGraph(g *depgraph.MutableDirectedGraph) (*depgraph.MutableDirectedGraph, error) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
weird
return nil | ||
} | ||
|
||
func printYours(g *depgraph.MutableDirectedGraph, roots, nodes []graph.Node, you *depgraph.Node) ([]graph.Node, error) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You have a print method that is mutating its input?? Don't do that.
from := g.From(n) | ||
|
||
// found an existing orphaned node to exclude | ||
if len(to) == 0 && len(from) == 0 { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When does this happen? I would expect these to snipped before you call this method.
g.RemoveNode(you) | ||
|
||
// find nodes not reachable from our given roots | ||
yours := findOrphans(g, roots, nodes) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given a clean starting graph, the idea of "roots" doesn't exist.
for _, n := range nodes { | ||
isOrphan := true | ||
for _, root := range roots { | ||
if hasPathFromTo(g, root, n) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This doesn't look right. If you started your graph without orphans, then any orphans you're finding later on the result of the removing the "from" on an inbound link.
return nil | ||
} | ||
|
||
func printYours(g *depgraph.MutableDirectedGraph, roots, nodes []graph.Node, you *depgraph.Node) ([]graph.Node, error) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The logic in here looks wrong. I expected you to produce a new graph by filtering out a selected set of nodes. Then after you've built your new graph, you simply scan all nodes to see if they don't have any inbound links. You track those, then built a new graph using the nodes without any inbound links as your input. Repeat until you have no changes. Then union all the nodes you removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Notice that at no point do we discuss "roots". I'm still trying to figure out why that concept is needed.
c6fadf7
to
45b85cd
Compare
tools/depcheck/pkg/graph/graph.go
Outdated
@@ -55,7 +55,64 @@ func (g *MutableDirectedGraph) AddNode(n *Node) error { | |||
return nil | |||
} | |||
|
|||
func (g *MutableDirectedGraph) RemoveNode(n *Node) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
godoc whether this leaves dangling edges
tools/depcheck/pkg/graph/graph.go
Outdated
func (g *MutableDirectedGraph) NodeByName(name string) (graph.Node, bool) { | ||
n, exists := g.nodesByName[name] | ||
return n, exists && g.DirectedGraph.Has(n) | ||
} | ||
|
||
// remove nodes with no inbound edges | ||
func (g *MutableDirectedGraph) PruneOrphans() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Return a list of the nodes you removed.
// - "Yours": a list of every node in the set unique to the dependency tree of a given target node. | ||
// - "Mine": a list of every node in the set unique to the dependency tree of the root nodes (set non-overlapping with given target node) | ||
// - "Ours": a list of every node in the overlapping set between the dependency tree of the root nodes and a given target node | ||
func (o *TraceImportsOpts) analyzeGraph(g *depgraph.MutableDirectedGraph) error { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is really hard to read. Separate your library methods (things operating on input data), from your command wiring (things combining a set of library methods). I should have a small method for identifying "which repos are only present because this set of nodes is present".
// This operation is done recursively until we no longer cause | ||
// any nodes to become orphaned after pruning the removed set. | ||
// The resulting set of total nodes recursively removed is returned. | ||
func calculateOrphans(g *depgraph.MutableDirectedGraph, targetNodes []*depgraph.Node) []*depgraph.Node { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is the graph method I want. Peer to the other graph methods I'd think. Unit tests located there to check it.
continue | ||
} | ||
|
||
orphans = append(orphans, n.(*depgraph.Node)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Isn't this exactly what your PruneOrphans method does? I didn't look, but for that method to work it would have to be recursive. It is recursive, right? If not, write the test that will cause that method to break and then fix it.
// This operation is done recursively until we no longer cause | ||
// any nodes to become orphaned after pruning the removed set. | ||
// The resulting set of total nodes recursively removed is returned. | ||
func calculateOrphans(g *depgraph.MutableDirectedGraph, targetNodes []*depgraph.Node) []*depgraph.Node { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Bad name. This is FindExclusiveDependencies(g, targetNodes)
} | ||
|
||
type TraceImportsFlags struct { | ||
Roots []string |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this should be dead, rigth?
excludes []string | ||
filters []string | ||
|
||
Analyze string |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
bad name, what is this?
` | ||
|
||
type TraceImportsOpts struct { | ||
Roots []string |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dead I hope
fmt.Printf("Analyzing a total of %v packages\n", len(g.Nodes())) | ||
fmt.Println() | ||
|
||
yours := calculateOrphans(g, []*depgraph.Node{you}) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why are you messing with a single repo/node. That doesn't make sense for what this needs to do.
} | ||
|
||
cmd.Flags().StringSliceVar(&flags.Roots, "root", flags.Roots, "set of entrypoints for dependency trees used to generate a depedency graph.") | ||
cmd.Flags().StringVarP(&flags.Exclude, "exclude", "e", "", "json file containing a list of import-paths of packages to recursively exclude when traversing the set of given entrypoints specified through --root.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
description sounds bad. I'd want to choose recursive exclusion or not based on ...
in golang, right?
cmd.Flags().StringVarP(&flags.Exclude, "exclude", "e", "", "json file containing a list of import-paths of packages to recursively exclude when traversing the set of given entrypoints specified through --root.") | ||
cmd.Flags().StringVarP(&flags.OutputFormat, "output", "o", "", "output generated dependency graph in specified format. One of: dot.") | ||
cmd.Flags().StringVarP(&flags.Collapse, "collapse", "c", "", "json file containing a list of import-paths of packages to collapse sub-packages into.") | ||
cmd.Flags().StringVarP(&flags.Analyze, "analyze", "a", "", "output a summary report on the dependency set of a given repository. Mutually exclusive with --output") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if its mutually exclusive, why do we have this lever? Is this just a different output mode or is it (more likely) a different command.
}, | ||
} | ||
|
||
cmd.Flags().StringSliceVar(&flags.Roots, "root", flags.Roots, "set of entrypoints for dependency trees used to generate a depedency graph.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It really seems like you ought to have a set of common flags for building the graph, then a separate set of flags for analyzing it and that you're re-use the flags for building the graph in multiple different commands.
@deads2k Went ahead and rebased with approved changes from #18466. Added the analysis bits on top. |
/retest |
/test extended_networking_minimal |
prereq merged. rebase for smaller diff |
36b30fa
to
039582b
Compare
tools/depcheck/pkg/cmd/analyze.go
Outdated
` | ||
|
||
type AnalyzeOptions struct { | ||
*graph.GraphOptions |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no anonymous inclusion
tools/depcheck/pkg/cmd/analyze.go
Outdated
type AnalyzeOptions struct { | ||
*graph.GraphOptions | ||
|
||
// Packages to analyze against |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
against?
tools/depcheck/pkg/cmd/analyze.go
Outdated
} | ||
|
||
type AnalyzeFlags struct { | ||
*graph.GraphFlags |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no anonymous inclusion
tools/depcheck/pkg/cmd/analyze.go
Outdated
// - "Yours": a list of every node in the set unique to the dependency tree of a given target node. | ||
// - "Mine": a list of every node in the set unique to the dependency tree of the root nodes (set non-overlapping with given target node) | ||
// - "Ours": a list of every node in the overlapping set between the dependency tree of the root nodes and a given target node | ||
func (o *AnalyzeOptions) analyzeGraph(g *graph.MutableDirectedGraph) error { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
needs unit tests
Add some tests. Looks pretty good. |
d84f0e7
to
98802b7
Compare
@deads2k thanks, added tests |
98802b7
to
f389dcf
Compare
f389dcf
to
4792756
Compare
@deads2k addressed in-person feedback regarding Approving per our conversation. |
/test unit |
/test gcp |
@juanvallejo: you cannot LGTM your own PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
[APPROVALNOTIFIER] This PR is APPROVED Approval requirements bypassed by manually added approval. This pull-request has been approved by: juanvallejo The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test all [submit-queue is verifying that this PR is safe to merge] |
@juanvallejo: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
Automatic merge from submit-queue (batch tested with PRs 18999, 18543). |
Depends on #18466
Depends on #18606
Adds dependency graph analysis.
Outputs "yours", "mine", "ours" dependencies.
Usage:
Output of the command above
cc @deads2k