1 、 Когда я добавляю новый хост в кластер, я встречал следующие ошибки. Как видите, в журнале ошибок нет!
2. Нет ошибок в ambari-server.log и ambari-agent.log
3 、 Я пытался добавить другие хосты, все они не удалось.
4 、 межсетевой экран 、 iptables 、 selinux все отключено. Я использую RHEL7.3 и Python2.7.5
5 、 Когда я добавляю эту физическую машину в кластер, который сервер Ambari установил на виртуальной машине, он может быть успешно зарегистрирован, но когда я добавляю его в кластер, который сервер Ambari установил на физическую машину, это конфигурация также, как показано ниже, я столкнулся с ошибкой.
6, физическая конфигурация машины
Процессор
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 112
On-line CPU(s) list: 0-111
Thread(s) per core: 2
Core(s) per socket: 14
Socket(s): 4
NUMA node(s): 4
Vendor ID: GenuineIntel
CPU family: 6
Model: 79
Model name: Intel(R) Xeon(R) CPU E7-4830 v4 @ 2.00GHz
Stepping: 1
CPU MHz: 1995.140
BogoMIPS: 3996.78
Virtualization: VT-x
L1d cache: 32K
L1i cache: 32K
L2 cache: 256K
L3 cache: 35840K
NUMA node0 CPU(s): 0-13,56-69
NUMA node1 CPU(s): 14-27,70-83
NUMA node2 CPU(s): 28-41,84-97
NUMA node3 CPU(s): 42-55,98-111
Mem
free -g
total used free shared buff/cache available
Mem: 503 4 497 0 1 498
Swap: 31 0 31
7, конфигурация виртуальной машины:
CPU
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 16
On-line CPU(s) list: 0-15
Thread(s) per core: 1
Core(s) per socket: 1
Socket(s): 16
NUMA node(s): 1
Vendor ID: GenuineIntel
CPU family: 6
Model: 42
Model name: Intel Xeon E312xx (Sandy Bridge)
Stepping: 1
CPU MHz: 2199.998
BogoMIPS: 4399.99
Virtualization: VT-x
Hypervisor vendor: KVM
Virtualization type: full
L1d cache: 32K
L1i cache: 32K
L2 cache: 4096K
L3 cache: 16384K
NUMA node0 CPU(s): 0-15
Пам
free -g
total used free shared buff/cache available
Mem: 31 6 9 0 14 23
Swap: 3 0 3
введите описание изображения здесь
Вот содержимое ambari-agent.log.
Кто-нибудь сталкивался с подобной ошибкой?
==========================
Creating target directory...
==========================
Command start time 2018-11-06 08:36:13
Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13
==========================
Copying ambari sudo script...
==========================
Command start time 2018-11-06 08:36:13
scp /var/lib/ambari-server/ambari-sudo.sh
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13
==========================
Copying common functions script...
==========================
Command start time 2018-11-06 08:36:13
scp /usr/lib/python2.6/site-packages/ambari_commons
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13
==========================
Copying OS type check script...
==========================
Command start time 2018-11-06 08:36:13
scp /usr/lib/python2.6/site-packages/ambari_server/os_check_type.py
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13
==========================
Running OS type check...
==========================
Command start time 2018-11-06 08:36:13
Cluster primary/cluster OS family is redhat7 and local/current OS family is redhat7
Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14
==========================
Checking 'sudo' package on remote host...
==========================
Command start time 2018-11-06 08:36:14
Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14
==========================
Copying repo file to 'tmp' folder...
==========================
Command start time 2018-11-06 08:36:14
scp /etc/yum.repos.d/ambari.repo
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14
==========================
Moving file to repo dir...
==========================
Command start time 2018-11-06 08:36:14
Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14
==========================
Changing permissions for ambari.repo...
==========================
Command start time 2018-11-06 08:36:14
Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14
==========================
Copying setup script file...
==========================
Command start time 2018-11-06 08:36:14
scp /usr/lib/python2.6/site-packages/ambari_server/setupAgent.py
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14
==========================
Running setup agent script...
==========================
Command start time 2018-11-06 08:36:14
('INFO 2018-11-05 17:03:53,137 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-05 17:03:53,137 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-05 17:03:53,189 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-05 17:03:53,191 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-05 17:03:53,191 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,394 HeartbeatHandlers.py:83 - Ambari-agent received 15 signal, stopping...
INFO 2018-11-06 08:36:46,537 main.py:226 - Agent not going to die gracefully, going to execute kill -9
INFO 2018-11-06 08:36:46,553 ExitHelper.py:53 - Performing cleanup before exiting...
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,906 DataCleaner.py:39 - Data cleanup thread started
INFO 2018-11-06 08:36:46,908 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-06 08:36:46,909 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-06 08:36:46,961 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-06 08:36:46,964 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-06 08:36:46,964 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
', None)
('INFO 2018-11-05 17:03:53,137 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-05 17:03:53,137 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-05 17:03:53,189 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-05 17:03:53,191 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-05 17:03:53,191 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,394 HeartbeatHandlers.py:83 - Ambari-agent received 15 signal, stopping...
INFO 2018-11-06 08:36:46,537 main.py:226 - Agent not going to die gracefully, going to execute kill -9
INFO 2018-11-06 08:36:46,553 ExitHelper.py:53 - Performing cleanup before exiting...
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,906 DataCleaner.py:39 - Data cleanup thread started
INFO 2018-11-06 08:36:46,908 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-06 08:36:46,909 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-06 08:36:46,961 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-06 08:36:46,964 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-06 08:36:46,964 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
', None)
Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:49
Registering with the server...
Registration with the server failed.
Если у кого-то возникла такая же проблема, надеюсь, вы мне поможете. Большое спасибо!