Add the `tensorboard.summary` public entry point #562

wchargin · 2017-09-23T03:53:49Z

Summary:
Resolves #561.

Test Plan:
Ran the build_pip_package.sh script to build the Python 2 package,
then created a new Python 2 virtualenv in which I installed only the
latest nightly TensorFlow release (2017-09-22) and the generated wheel.
Then wrote

import tensorflow as tf
from tensorboard import summary
result = tf.Session().run(summary.scalar('shelby', tf.constant(3)))
s = tf.Summary()
s.ParseFromString(result)
print(s)

and verified that the result was a proper scalar summary.

I then repeated the process using pip install tensorflow==1.3.0
instead of the nightly version to ensure that the module works for
normal TensorFlow users. (Note: the audio summary will not work because
TensorFlow 1.3 lacks 22730fd4c633a74e59c03ff76dc92e6ae2d5d020. This is
fine; users can continue to use the old API for that one.)

wchargin-branch: summary-module

chihuahua · 2017-09-24T02:18:01Z

Nice. Could we note in plugin docs to update tensorboard/summary.py when a summary's added?

wchargin · 2017-09-24T13:30:29Z

"note in plugin docs"—what plugin docs? We don't have a readme for tensorboard/plugins, nor do we need one; this is not public-facing, and it's not something that plugin developers need to know about.

It should be a rare thing for a summary to be added to TensorBoard core, and it may well be the case that we want to offer a plugin in tensorboard/plugins before with publish it in tensorboard.summary.

teamdandelion · 2017-09-25T18:44:49Z

Looks nice. Could the test verify that the _pb method is present too? Maybe with an exclusions set for those summaries that don't have a _pb method (although that should tend towards the empty set).

wchargin · 2017-09-25T19:15:26Z

Could the test verify that the _pb method is present too? Maybe with an exclusions set for those summaries that don't have a _pb method

Sure.

@chihuahua what's the status of writing pr_curve_pb?

jart

LGTM. Please wait for Dandelion to approve.

jart · 2017-09-25T21:04:40Z

tensorboard/summary_test.py

+      self.assertIsInstance(getattr(summary, plugin), collections.Callable)
+
+  def test_plugins_export_pb_functions(self):
+    for plugin in STANDARD_PLUGINS:


Our unit test best practices guide actually recommends against using things like for loops in unit tests. One notable exception is the Go language. But generally speaking, folks at Google err on the side of just having a lot of boilerplate to test each thing, so when the test runner tells you which method failed, you know the plugin for sure. You also wouldn't need to have special case logic for the pr_curve plugin.

But this is fine. I don't feel too strongly about it.

I never really understood that logic. I agree that the burden of being "clearly correct" is stronger for test code than main code, but this loop is clearly correct. (It looks like we have 130 for-loops in test code currently.) (Plus, why should the Go language be any different?)

I thought about setting a custom message, but the case in which this will fail will almost always be the getattr failing, not that the user exports something that is not a function, and in that case the name of the plugin that failed will appear in the error message. (It's very frustrating that a custom message replaces the default message instead of appending to it, or I'd use custom messages much more often.)

jart · 2017-09-25T21:05:56Z

tensorboard/summary.py

+scalar_pb = _scalar_summary.pb
+
+text = _text_summary.op
+text_pb = _text_summary.pb


This is a fantastic design. I like it a lot. You also followed good best practices for delegate imports.

Cool; thanks. I know that you had some reservations about doing this carefully, so I'm glad that you approve.

Out of curiosity…do you have a reference for these best practices for delegate imports? I just did what felt natural here, but I'd be interested to see what other people think.

wchargin

Will await Dandelion's approval. Thanks for reviewing!

wchargin · 2017-09-25T21:12:13Z

tensorboard/summary_test.py

+      self.assertIsInstance(getattr(summary, plugin), collections.Callable)
+
+  def test_plugins_export_pb_functions(self):
+    for plugin in STANDARD_PLUGINS:


I never really understood that logic. I agree that the burden of being "clearly correct" is stronger for test code than main code, but this loop is clearly correct. (It looks like we have 130 for-loops in test code currently.) (Plus, why should the Go language be any different?)

I thought about setting a custom message, but the case in which this will fail will almost always be the getattr failing, not that the user exports something that is not a function, and in that case the name of the plugin that failed will appear in the error message. (It's very frustrating that a custom message replaces the default message instead of appending to it, or I'd use custom messages much more often.)

wchargin · 2017-09-25T21:12:27Z

tensorboard/summary.py

+scalar_pb = _scalar_summary.pb
+
+text = _text_summary.op
+text_pb = _text_summary.pb


Cool; thanks. I know that you had some reservations about doing this carefully, so I'm glad that you approve.

jart · 2017-09-25T22:13:37Z

Here's how the best practices guide justifies it:

Tests should be straightforward enough to be 'obviously correct' upon inspection. Control structures (like loops and conditionals) can increase the chance for errors. When a test fails, having complex logic in the test makes it harder to diagnose the root cause of the failure.

It's usually impossible for production code to exhaustively list every output for each possible input, so we have to use the full power of our programming languages there. But test code doesn't need to cover every input - only a small set of representative values that mark the interesting cases in the code. Increasing the complexity of a single test by using conditionals or loops to cover multiple input types can quickly lead to logic errors in the tests themselves. In other words, tests should strive to be "above suspicion" when it comes to faulty behavior, lest they require tests of their own!

Tests written without control structures are clearer since the reader doesn't have to do any mental computations to understand them, and more likely to be correct since it's harder to have bugs in code without these constructs.

When tests do need their own logic, such logic should often be moved out of the test bodies and into utilities and helper functions. Since such helpers can get quite complex, it's usually a good idea for any nontrivial test utility to have its own tests.

Exception: It it okay to loop through an enumeration of values (e.g. an enum or a list of error codes) to perform a common set of assertions on them, often called data-driven testing. However, if you find yourself introducing conditionals to further optimize your data- driven tests (e.g. to apply different asserts to different inputs) you are better off creating separate test cases instead.

So it seems like you were correct in the beginning, until the pr_curve plugin broke your assumption and extra logic had to be added.

jart · 2017-09-25T22:20:46Z

Go gets to be special because it's Go. Without exceptions or macros, it's probably not possible to write unit tests the normal way. So they'll usually just have some complicated function that loops through an array of test vector objects.

teamdandelion

Looks good, modulo one small nit :) Thanks William!

teamdandelion · 2017-09-26T18:12:33Z

tensorboard/summary_test.py

+# graph. This set should ideally be empty; any entries here should be
+# considered temporary.
+PLUGINS_WITHOUT_PB_FUNCTIONS = frozenset([
+    'pr_curve',  # TODO(@chihuahua): Fix this.


Nit: Instead of making this a raw TODO, can we link it to a GitHub issue?

Added reference to #445, which already exists and is assigned to @chihuahua . Will merge once Travis finishes.

Summary: Resolves #561. Test Plan: Ran the `build_pip_package.sh` script to build the Python 2 package, then created a new Python 2 virtualenv in which I installed only the latest nightly TensorFlow release (2017-09-22) and the generated wheel. Then wrote ```py import tensorflow as tf from tensorboard import summary result = tf.Session().run(summary.scalar('shelby', tf.constant(3))) s = tf.Summary() s.ParseFromString(result) print(s) ``` and verified that the result was a proper scalar summary. I then repeated the process using `pip install tensorflow==1.3.0` instead of the nightly version to ensure that the module works for normal TensorFlow users. (Note: the audio summary will not work because TensorFlow 1.3 lacks 22730fd4c633a74e59c03ff76dc92e6ae2d5d020. This is fine; users can continue to use the old API for that one.) wchargin-branch: summary-module

Summary: Resolves tensorflow#561. Test Plan: Ran the `build_pip_package.sh` script to build the Python 2 package, then created a new Python 2 virtualenv in which I installed only the latest nightly TensorFlow release (2017-09-22) and the generated wheel. Then wrote ```py import tensorflow as tf from tensorboard import summary result = tf.Session().run(summary.scalar('shelby', tf.constant(3))) s = tf.Summary() s.ParseFromString(result) print(s) ``` and verified that the result was a proper scalar summary. I then repeated the process using `pip install tensorflow==1.3.0` instead of the nightly version to ensure that the module works for normal TensorFlow users. (Note: the audio summary will not work because TensorFlow 1.3 lacks 22730fd4c633a74e59c03ff76dc92e6ae2d5d020. This is fine; users can continue to use the old API for that one.) wchargin-branch: summary-module

Summary: Resolves #561. Test Plan: Ran the `build_pip_package.sh` script to build the Python 2 package, then created a new Python 2 virtualenv in which I installed only the latest nightly TensorFlow release (2017-09-22) and the generated wheel. Then wrote ```py import tensorflow as tf from tensorboard import summary result = tf.Session().run(summary.scalar('shelby', tf.constant(3))) s = tf.Summary() s.ParseFromString(result) print(s) ``` and verified that the result was a proper scalar summary. I then repeated the process using `pip install tensorflow==1.3.0` instead of the nightly version to ensure that the module works for normal TensorFlow users. (Note: the audio summary will not work because TensorFlow 1.3 lacks 22730fd4c633a74e59c03ff76dc92e6ae2d5d020. This is fine; users can continue to use the old API for that one.) wchargin-branch: summary-module

wchargin requested review from jart and teamdandelion September 23, 2017 03:53

wchargin mentioned this pull request Sep 23, 2017

Cherry picks for TensorBoard 0.1.7 #560

Merged

chihuahua added the core:summaries label Sep 24, 2017

wchargin force-pushed the wchargin-summary-module branch from f1f0712 to c316b15 Compare September 25, 2017 19:22

jart approved these changes Sep 25, 2017

View reviewed changes

wchargin commented Sep 25, 2017

View reviewed changes

teamdandelion approved these changes Sep 26, 2017

View reviewed changes

wchargin force-pushed the wchargin-summary-module branch from c316b15 to 4808b57 Compare September 26, 2017 18:22

wchargin force-pushed the wchargin-summary-module branch from 4808b57 to 4db1c24 Compare September 26, 2017 18:31

wchargin merged commit ab6ffbb into master Sep 26, 2017

jart deleted the wchargin-summary-module branch March 27, 2018 23:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the `tensorboard.summary` public entry point #562

Add the `tensorboard.summary` public entry point #562

wchargin commented Sep 23, 2017

chihuahua commented Sep 24, 2017

wchargin commented Sep 24, 2017

teamdandelion commented Sep 25, 2017

wchargin commented Sep 25, 2017

jart left a comment

jart Sep 25, 2017

wchargin Sep 25, 2017

jart Sep 25, 2017

wchargin Sep 25, 2017

wchargin Sep 26, 2017

wchargin left a comment

wchargin Sep 25, 2017

wchargin Sep 25, 2017

jart commented Sep 25, 2017

jart commented Sep 25, 2017

teamdandelion left a comment

teamdandelion Sep 26, 2017

wchargin Sep 26, 2017

wchargin Sep 26, 2017

Add the tensorboard.summary public entry point #562

Add the tensorboard.summary public entry point #562

Conversation

wchargin commented Sep 23, 2017

chihuahua commented Sep 24, 2017

wchargin commented Sep 24, 2017

teamdandelion commented Sep 25, 2017

wchargin commented Sep 25, 2017

jart left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wchargin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jart commented Sep 25, 2017

jart commented Sep 25, 2017

teamdandelion left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add the `tensorboard.summary` public entry point #562

Add the `tensorboard.summary` public entry point #562