Adding some more documentation about tests (aa10d32c) · Commits · 钟慕尧 / deepchem

deepchem/models/tests/test_reload.py

+3 −0

Original line number	Diff line number	Diff line
		@@ -532,6 +532,7 @@ def test_weave_classification_reload():
		assert scores[classification_metric.name] > .9


		# TODO: THIS IS FAILING!
		def test_MPNN_regression_reload():
		"""Test MPNN can reload datasets."""
		np.random.seed(123)
		@@ -600,6 +601,7 @@ def test_MPNN_regression_reload():
		assert scores[regression_metric.name] > .8


		# TODO: THIS IS FAILING!
		def test_textCNN_classification_reload():
		"""Test textCNN model reloadinng."""
		np.random.seed(123)
		@@ -717,6 +719,7 @@ def test_1d_cnn_regression_reload():
		assert scores[regression_metric.name] < 0.1


		# TODO: THIS IS FAILING!
		def test_graphconvmodel_reload():
		featurizer = dc.feat.ConvMolFeaturizer()
		tasks = ["outcome"]

+18 −0

Original line number	Diff line number	Diff line
		@@ -63,6 +63,24 @@ that break them may sometimes slip through and get merged into the repository.
		We still try to run them regularly, so hopefully the problem will be discovered
		fairly soon.

		Testing Machine Learning Models
		-------------------------------

		Testing the correctness of a machine learning model can be quite tricky to do
		in practice. When adding a new machine learning model to DeepChem, you should
		add at least a few basic types of unit tests:

		- Overfitting test: Create a small synthetic dataset and test that your model
		can learn this datasest with high accuracy. For regression and classification
		task, this should correspond to low training error on the dataset. For
		generative tasks, this should correspond to low loss on the dataset.
		- Reloading test: Check that a trained model can be saved to disk and reloaded
		correctly. This should involve predictions from the saved and reloaded models
		matching exactly.

		Note that unit tests are not sufficient to gauge the real performance of a
		model. You should benchmark your model on larger datasets as well and report
		your benchmarking tests in the PR comments.

		Type Annotations
		----------------