Tensorflow DNNclassifier:错误 wile 训练(numpy.ndarray 没有属性索引)

Tensorflow DNNclassifier: error wile training (numpy.ndarray has no attribute index)

提问人:Cybercop 提问时间:4/4/2018 最后编辑:MaximCybercop 更新时间:1/29/2019 访问量:1247

问:

我正在尝试在 tensorflow 中训练一个DNNClassifier

这是我的代码

train_input_fn = tf.estimator.inputs.pandas_input_fn(
    x=X_train,
    y=y_train,
    batch_size=1000,
    shuffle = True
)


nn_classifier = tf.estimator.DNNClassifier(hidden_units=[1300,1300,1300], feature_columns=X_train, n_classes=200)
nn_classifier.train(input_fn = train_input_fn,  steps=2000)

这是外观y_train

[450 450 450 ... 327 327 327]

类型:numpy.ndarray

这是看起来的样子X_train

[[ 9.79285  11.659035  1.279528 ...  1.258979  1.063923 -2.45522 ]
 [ 8.711333 13.92955   1.117603 ...  3.588921  1.231256 -3.180302]
 [ 5.159803 14.059619  1.740708 ...  0.28172  -0.506701 -1.326669]
 ...
 [ 2.418473  0.542642 -3.658447 ...  4.631474  4.544892 -4.595605]
 [ 6.51176   4.321688 -1.483697 ...  3.13299   5.476103 -2.833903]
 [ 6.894113  5.986267 -1.178247 ...  2.305603  7.217919 -2.152574]]

类型:numpy.ndarray

错误:

in pandas_input_fn(x, y, batch_size, num_epochs, shuffle, queue_capacity, num_threads, target_column)
     85           'Cannot use name %s for target column: DataFrame already has a '
     86           'column with that name: %s' % (target_column, x.columns))
---> 87     if not np.array_equal(x.index, y.index):
     88       raise ValueError('Index for x and y are mismatched.\nIndex for x: %s\n'
     89                        'Index for y: %s\n' % (x.index, y.index))

更新 1:使用 numpy_input_fn

train_input_fn= tf.estimator.inputs.numpy_input_fn(
    x=X_train,
    y=y_train,
    batch_size=1000,
    shuffle = True
)

错误:

INFO:tensorflow:Calling model_fn.

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-23-3b7c6b879e38> in <module>()
     10 start_time = time.time()
     11 nn_classifier = tf.estimator.DNNClassifier(hidden_units=[1300,1300,1300], feature_columns=X_train, n_classes=200)
---> 12 nn_classifier.train(input_fn = train_input_fn,  steps=2000)
     13 total_time = start_time - time.time()

c:\users\appdata\local\programs\python\python36\lib\site-packages\tensorflow\python\estimator\estimator.py in train(self, input_fn, hooks, steps, max_steps, saving_listeners)
    353 
    354     saving_listeners = _check_listeners_type(saving_listeners)
--> 355     loss = self._train_model(input_fn, hooks, saving_listeners)
    356     logging.info('Loss for final step: %s.', loss)
    357     return self

c:\users\appdata\local\programs\python\python36\lib\site-packages\tensorflow\python\estimator\estimator.py in _train_model(self, input_fn, hooks, saving_listeners)
    822       worker_hooks.extend(input_hooks)
    823       estimator_spec = self._call_model_fn(
--> 824           features, labels, model_fn_lib.ModeKeys.TRAIN, self.config)
    825 
    826       if self._warm_start_settings:

c:\users\appdata\local\programs\python\python36\lib\site-packages\tensorflow\python\estimator\estimator.py in _call_model_fn(self, features, labels, mode, config)
    803 
    804     logging.info('Calling model_fn.')
--> 805     model_fn_results = self._model_fn(features=features, **kwargs)
    806     logging.info('Done calling model_fn.')
    807 

c:\users\appdata\local\programs\python\python36\lib\site-packages\tensorflow\python\estimator\canned\dnn.py in _model_fn(features, labels, mode, config)
    347           head=head,
    348           hidden_units=hidden_units,
--> 349           feature_columns=tuple(feature_columns or []),
    350           optimizer=optimizer,
    351           activation_fn=activation_fn,

ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

知道我做错了什么吗?

python pandas numpy tensorflow tensorflow-estimator

评论


答:

0赞 Maxim 4/19/2018 #1

问题出在估计器上的参数上。查看 tf.estimator.DNNClassifier 文档:feature_columns

feature_columns:包含模型使用的所有特征列的可迭代对象。集合中的所有项都应该是从 派生的类的实例。_FeatureColumn

文档中还有一个示例用法。您看起来像许多数字列,在这种情况下,您可以简单地创建一个如下所示的列表:X_train

feature_columns = [tf.feature_column.numeric_column(i) for i in range(...)]
0赞 vinn 1/29/2019 #2

我今天遇到了这个错误,并认为如果我证明一个解决方案会很棒。 问题是由 带来的。根据 TensorFlow 文档,必须是实例,并且必须是 OR 实例。该函数可以帮助确定 ur 和 值的数据类型。将 和 更改为适当的数据类型可以解决问题。tf.estimator.inputs.numpy_input_fnXpandas.DataFrameypandas.Seriespandas.DataFrametype()X_trainy_trainX_trainy_train