Регистрация агента Ambari не удалась и ошибок не было найдено! Я использую RHEL7.3 и Python2.7.5 - PullRequest
0 голосов
/ 07 ноября 2018

1 、 Когда я добавляю новый хост в кластер, я встречал следующие ошибки. Как видите, в журнале ошибок нет! 2. Нет ошибок в ambari-server.log и ambari-agent.log

3 、 Я пытался добавить другие хосты, все они не удалось.

4 、 межсетевой экран 、 iptables 、 selinux все отключено. Я использую RHEL7.3 и Python2.7.5

5 、 Когда я добавляю эту физическую машину в кластер, который сервер Ambari установил на виртуальной машине, он может быть успешно зарегистрирован, но когда я добавляю его в кластер, который сервер Ambari установил на физическую машину, это конфигурация также, как показано ниже, я столкнулся с ошибкой.

6, физическая конфигурация машины
Процессор

Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                112
On-line CPU(s) list:   0-111
Thread(s) per core:    2
Core(s) per socket:    14
Socket(s):             4
NUMA node(s):          4
Vendor ID:             GenuineIntel
CPU family:            6
Model:                 79
Model name:            Intel(R) Xeon(R) CPU E7-4830 v4 @ 2.00GHz
Stepping:              1
CPU MHz:               1995.140
BogoMIPS:              3996.78
Virtualization:        VT-x
L1d cache:             32K
L1i cache:             32K
L2 cache:              256K
L3 cache:              35840K
NUMA node0 CPU(s):     0-13,56-69
NUMA node1 CPU(s):     14-27,70-83
NUMA node2 CPU(s):     28-41,84-97
NUMA node3 CPU(s):     42-55,98-111

Mem

free -g
              total        used        free      shared  buff/cache   available
Mem:            503           4         497           0           1         498
Swap:            31           0          31

7, конфигурация виртуальной машины:
CPU

Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                16
On-line CPU(s) list:   0-15
Thread(s) per core:    1
Core(s) per socket:    1
Socket(s):             16
NUMA node(s):          1
Vendor ID:             GenuineIntel
CPU family:            6
Model:                 42
Model name:            Intel Xeon E312xx (Sandy Bridge)
Stepping:              1
CPU MHz:               2199.998
BogoMIPS:              4399.99
Virtualization:        VT-x
Hypervisor vendor:     KVM
Virtualization type:   full
L1d cache:             32K
L1i cache:             32K
L2 cache:              4096K
L3 cache:              16384K
NUMA node0 CPU(s):     0-15

Пам

free -g
              total        used        free      shared  buff/cache   available
Mem:             31           6           9           0          14          23
Swap:             3           0           3

введите описание изображения здесь

Вот содержимое ambari-agent.log. Кто-нибудь сталкивался с подобной ошибкой?

==========================
Creating target directory...
==========================

Command start time 2018-11-06 08:36:13

Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13

==========================
Copying ambari sudo script...
==========================

Command start time 2018-11-06 08:36:13

scp /var/lib/ambari-server/ambari-sudo.sh
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13

==========================
Copying common functions script...
==========================

Command start time 2018-11-06 08:36:13

scp /usr/lib/python2.6/site-packages/ambari_commons
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13

==========================
Copying OS type check script...
==========================

Command start time 2018-11-06 08:36:13

scp /usr/lib/python2.6/site-packages/ambari_server/os_check_type.py
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:13

==========================
Running OS type check...
==========================

Command start time 2018-11-06 08:36:13
Cluster primary/cluster OS family is redhat7 and local/current OS family is redhat7

Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14

==========================
Checking 'sudo' package on remote host...
==========================

Command start time 2018-11-06 08:36:14

Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14

==========================
Copying repo file to 'tmp' folder...
==========================

Command start time 2018-11-06 08:36:14

scp /etc/yum.repos.d/ambari.repo
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14

==========================
Moving file to repo dir...
==========================

Command start time 2018-11-06 08:36:14

Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14

==========================
Changing permissions for ambari.repo...
==========================

Command start time 2018-11-06 08:36:14

Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14

==========================
Copying setup script file...
==========================

Command start time 2018-11-06 08:36:14

scp /usr/lib/python2.6/site-packages/ambari_server/setupAgent.py
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:14

==========================
Running setup agent script...
==========================

Command start time 2018-11-06 08:36:14
('INFO 2018-11-05 17:03:53,137 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-05 17:03:53,137 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-05 17:03:53,189 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-05 17:03:53,191 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-05 17:03:53,191 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,394 HeartbeatHandlers.py:83 - Ambari-agent received 15 signal, stopping...
INFO 2018-11-06 08:36:46,537 main.py:226 - Agent not going to die gracefully, going to execute kill -9
INFO 2018-11-06 08:36:46,553 ExitHelper.py:53 - Performing cleanup before exiting...
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,906 DataCleaner.py:39 - Data cleanup thread started
INFO 2018-11-06 08:36:46,908 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-06 08:36:46,909 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-06 08:36:46,961 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-06 08:36:46,964 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-06 08:36:46,964 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
', None)
('INFO 2018-11-05 17:03:53,137 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-05 17:03:53,137 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-05 17:03:53,189 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-05 17:03:53,191 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-05 17:03:53,191 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,388 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:16,394 HeartbeatHandlers.py:83 - Ambari-agent received 15 signal, stopping...
INFO 2018-11-06 08:36:46,537 main.py:226 - Agent not going to die gracefully, going to execute kill -9
INFO 2018-11-06 08:36:46,553 ExitHelper.py:53 - Performing cleanup before exiting...
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,905 main.py:90 - loglevel=logging.INFO
INFO 2018-11-06 08:36:46,906 DataCleaner.py:39 - Data cleanup thread started
INFO 2018-11-06 08:36:46,908 DataCleaner.py:120 - Data cleanup started
INFO 2018-11-06 08:36:46,909 DataCleaner.py:122 - Data cleanup finished
INFO 2018-11-06 08:36:46,961 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2018-11-06 08:36:46,964 main.py:349 - Connecting to Ambari server at https://test-dldsj-datanode01:8440 (10.212.155.20)
INFO 2018-11-06 08:36:46,964 NetUtil.py:62 - Connecting to https://test-dldsj-datanode01:8440/ca
', None)

Connection to test-dldsj-datanode01 closed.
SSH command execution finished
host=test-dldsj-datanode01, exitcode=0
Command end time 2018-11-06 08:36:49

Registering with the server...
Registration with the server failed.

Если у кого-то возникла такая же проблема, надеюсь, вы мне поможете. Большое спасибо!

...