Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
289 views
in Technique[技术] by (71.8m points)

What is BigQuery ML Evaluation based on when DATA_SPLIT_METHOD = 'NO_SPLIT'

Context: creating (training) a model in BigQuery ML (logistic regression, to be more precise).

Problem: A default value for DATA_SPLIT_METHOD is 'AUTO_SPLIT'. In this case, I understand what results are presented in the Evaluation tab in the BigQuery UI. However, even with the value set to 'NO_SPLIT', I still get the Evaluation results, as indicated in the screenshot below:

enter image description here

Could anyone confirm whether this is the evaluation on the training set, or...? I'm not sure what else it could be.

Thanks in advance for any suggestions!


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)

With NO_SPLIT, both training and the evaluation numbers are based on the entire training set.

Note that this means the evaluation number may be overly optimistic, as the model can just 'memorize' what it saw during training.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...