Sagemaker: ImportError: нет модуля с именем 'pandas' при запуске обучающего скрипта - PullRequest
0 голосов
/ 29 апреля 2020

На данный момент я запускаю эту точную записную книжку: https://github.com/udacity/sagemaker-deployment/blob/master/Project/SageMaker%20Project.ipynb

Просто с другим набором данных (который я получил в том же формате, что и в записной книжке).

когда я вызываю оценщика, я получаю сообщение об ошибке:

2020-04-29 17:18:03 Starting - Starting the training job...
2020-04-29 17:18:06 Starting - Launching requested ML instances...
2020-04-29 17:19:03 Starting - Preparing the instances for training......
2020-04-29 17:19:54 Downloading - Downloading input data
2020-04-29 17:19:54 Training - Downloading the training image.bash: cannot set terminal process group (-1): Inappropriate ioctl for device
bash: no job control in this shell



2020-04-29 17:20:13,936 sagemaker-containers ERROR    ExecuteUserScriptError:
Command "/usr/bin/python -m train --epochs 10 --hidden_dim 200"
Traceback (most recent call last):
  File "/usr/lib/python3.5/runpy.py", line 184, in _run_module_as_main
    "__main__", mod_spec)
  File "/usr/lib/python3.5/runpy.py", line 85, in _run_code
    exec(code, run_globals)
  File "/opt/ml/code/train.py", line 11, in <module>
    import pandas as pd
ImportError: No module named 'pandas'

2020-04-29 17:20:25 Uploading - Uploading generated training model
2020-04-29 17:20:25 Failed - Training job failed
---------------------------------------------------------------------------
UnexpectedStatusException                 Traceback (most recent call last)
<ipython-input-37-0e8223086435> in <module>()
----> 1 estimator.fit({'training': input_data})

~/anaconda3/envs/pytorch_p36/lib/python3.6/site-packages/sagemaker/estimator.py in fit(self, inputs, wait, logs, job_name, experiment_config)
    475         self.jobs.append(self.latest_training_job)
    476         if wait:
--> 477             self.latest_training_job.wait(logs=logs)
    478 
    479     def _compilation_job_name(self):

~/anaconda3/envs/pytorch_p36/lib/python3.6/site-packages/sagemaker/estimator.py in wait(self, logs)
   1084         # If logs are requested, call logs_for_jobs.
   1085         if logs != "None":
-> 1086             self.sagemaker_session.logs_for_job(self.job_name, wait=True, log_type=logs)
   1087         else:
   1088             self.sagemaker_session.wait_for_job(self.job_name)

~/anaconda3/envs/pytorch_p36/lib/python3.6/site-packages/sagemaker/session.py in logs_for_job(self, job_name, wait, poll, log_type)
   3042 
   3043         if wait:
-> 3044             self._check_job_status(job_name, description, "TrainingJobStatus")
   3045             if dot:
   3046                 print()

~/anaconda3/envs/pytorch_p36/lib/python3.6/site-packages/sagemaker/session.py in _check_job_status(self, job, desc, status_key_name)
   2636                 ),
   2637                 allowed_statuses=["Completed", "Stopped"],
-> 2638                 actual_status=status,
   2639             )
   2640 

UnexpectedStatusException: Error for Training job sagemaker-pytorch-2020-04-29-17-18-03-379: Failed. Reason: AlgorithmError: ExecuteUserScriptError:
Command "/usr/bin/python -m train --epochs 10 --hidden_dim 200"
Traceback (most recent call last):
  File "/usr/lib/python3.5/runpy.py", line 184, in _run_module_as_main
    "__main__", mod_spec)
  File "/usr/lib/python3.5/runpy.py", line 85, in _run_code
    exec(code, run_globals)
  File "/opt/ml/code/train.py", line 11, in <module>
    import pandas as pd
ImportError: No module named 'pandas'

Кто-нибудь знает, что я могу сделать, чтобы устранить эту проблему?

...