Кластер MariaDB: узел объединения Galera mariaDB не может присоединиться к кластеру при перезапуске - PullRequest
0 голосов
/ 31 марта 2020

Когда я перезагружаю сервер mysql в кластере galera, он показывает ошибки и останавливается, запускается непрерывно.

Кластер имеет 3 узла. и один узел так занят для трафика c. поэтому я перезагружаю узел временно. Но это не может подойти.

Я пытался обновить my.cnf и grastate.dat и так одно. Но все равно выкладывается ошибка.

Журнал ниже

2020-03-31 16:26:04 139748981156096 [Note] /usr/sbin/mysqld (mysqld 10.1.29-MariaDB-1~xenial) starting as process 4326 ...
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Read nil XID from storage engines, skipping position init
2020-03-31 16:26:04 139748981156096 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib/libgalera_smm.so'
2020-03-31 16:26:04 139748981156096 [Note] WSREP: wsrep_load(): Galera 25.3.22(r3764) by Codership Oy <info@codership.com> loaded successfully.
2020-03-31 16:26:04 139748981156096 [Note] WSREP: CRC-32C: using hardware acceleration.
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Found saved state: 00000000-0000-0000-0000-000000000000:-1, safe_to_bootstrap: 1
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Passing config to GCS: base_dir = /data/db/; base_host = 10.10.10.21; base_port = 4567; cert.log_conflicts = no; debug = no; evs.auto_evict = 0; evs.delay_margin = PT1S; evs.delayed_keep_period = PT30S; evs.inactive_check_period = PT0.5S; evs.inactive_timeout = PT15S; evs.join_retrans_period = PT1S; evs.max_install_timeouts = 3; evs.send_window = 4; evs.stats_report_period = PT1M; evs.suspect_timeout = PT5S; evs.user_send_window = 2; evs.view_forget_timeout = PT24H; gcache.dir = /data/db/; gcache.keep_pages_size = 0; gcache.mem_size = 0; gcache.name = /data/db//galera.cache; gcache.page_size = 128M; gcache.recover = no; gcache.size = 128M; gcomm.thread_prio = ; gcs.fc_debug = 0; gcs.fc_factor = 1.0; gcs.fc_limit = 16; gcs.fc_master_slave = no; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = no; gmcast.segment = 0; gmcast.version = 0; pc.announce_timeout = PT3S; pc.checksum = false; pc.ignore_quorum = fals
2020-03-31 16:26:04 139748981156096 [Note] WSREP: GCache history reset: 186fc65c-fcd7-11e7-a48c-e72eabf327ae:0 -> 00000000-0000-0000-0000-000000000000:-1
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Assign initial position for certification: -1, protocol version: -1
2020-03-31 16:26:04 139748981156096 [Note] WSREP: wsrep_sst_grab()
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Start replication
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Setting initial position to 00000000-0000-0000-0000-000000000000:-1
2020-03-31 16:26:04 139748981156096 [Note] WSREP: protonet asio version 0
2020-03-31 16:26:04 139748981156096 [Note] WSREP: Using CRC-32C for message checksums.
2020-03-31 16:26:04 139748981156096 [Note] WSREP: backend: asio
2020-03-31 16:26:04 139748981156096 [Note] WSREP: gcomm thread scheduling priority set to other:0
2020-03-31 16:26:04 139748981156096 [Warning] WSREP: access file(/data/db//gvwstate.dat) failed(No such file or directory)
2020-03-31 16:26:04 139748981156096 [Note] WSREP: restore pc from disk failed
2020-03-31 16:26:04 139748981156096 [Note] WSREP: GMCast version 0
2020-03-31 16:26:04 139748981156096 [Note] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') listening at tcp://0.0.0.0:4567
2020-03-31 16:26:04 139748981156096 [Note] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') multicast: , ttl: 1
2020-03-31 16:26:04 139748981156096 [Note] WSREP: EVS version 0
2020-03-31 16:26:04 139748981156096 [Note] WSREP: gcomm: connecting to group 'Albatross', peer '10.10.10.21:4567,10.10.10.31:4567,10.10.10.32:4567'
2020-03-31 16:26:04 139748981156096 [Note] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') connection established to e161ffcf tcp://10.10.10.21:4567
2020-03-31 16:26:04 139748981156096 [Warning] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') address 'tcp://10.10.10.21:4567' points to own listening address, blacklisting
2020-03-31 16:26:04 139748981156096 [Note] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') connection established to 4f71fc45 tcp://10.10.10.32:4567
2020-03-31 16:26:04 139748981156096 [Note] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') turning message relay requesting on, nonlive peers:
2020-03-31 16:26:04 139748981156096 [Note] WSREP: (e161ffcf, 'tcp://0.0.0.0:4567') connection established to 9086ffc0 tcp://10.10.10.31:4567
2020-03-31 16:26:05 139748981156096 [Note] WSREP: declaring 4f71fc45 at tcp://10.10.10.32:4567 stable
2020-03-31 16:26:05 139748981156096 [Note] WSREP: declaring 9086ffc0 at tcp://10.10.10.31:4567 stable
2020-03-31 16:26:05 139748981156096 [Note] WSREP: Node 4f71fc45 state prim
2020-03-31 16:26:05 139748981156096 [Note] WSREP: view(view_id(PRIM,4f71fc45,1391) memb {
        4f71fc45,0
        9086ffc0,0
        e161ffcf,0
} joined {
} left {
} partitioned {
})
2020-03-31 16:26:05 139748981156096 [Note] WSREP: save pc into disk
2020-03-31 16:26:05 139748981156096 [Note] WSREP: gcomm: connected
2020-03-31 16:26:05 139748981156096 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636
2020-03-31 16:26:05 139748981156096 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
2020-03-31 16:26:05 139748981156096 [Note] WSREP: Opened channel 'Albatross'
2020-03-31 16:26:05 139748626654976 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 2, memb_num = 3
2020-03-31 16:26:05 139748626654976 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
2020-03-31 16:26:05 139748626654976 [Note] WSREP: STATE EXCHANGE: sent state msg: e1a9f00e-7320-11ea-957b-fb8d9c83f3a4
2020-03-31 16:26:05 139748981156096 [Note] WSREP: Waiting for SST to complete.
2020-03-31 16:26:05 139748626654976 [Note] WSREP: STATE EXCHANGE: got state msg: e1a9f00e-7320-11ea-957b-fb8d9c83f3a4 from 0 (Albatross03)
2020-03-31 16:26:05 139748626654976 [Note] WSREP: STATE EXCHANGE: got state msg: e1a9f00e-7320-11ea-957b-fb8d9c83f3a4 from 1 (Albatross02)
2020-03-31 16:26:05 139748626654976 [Note] WSREP: STATE EXCHANGE: got state msg: e1a9f00e-7320-11ea-957b-fb8d9c83f3a4 from 2 (Albatross01)
2020-03-31 16:26:05 139748626654976 [Note] WSREP: Quorum results:
        version    = 4,
        component  = PRIMARY,
        conf_id    = 1386,
        members    = 2/3 (joined/total),
        act_id     = 36013067,
        last_appl. = -1,
        protocols  = 0/7/3 (gcs/repl/appl),
        group UUID = 186fc65c-fcd7-11e7-a48c-e72eabf327ae
2020-03-31 16:26:05 139748626654976 [Note] WSREP: Flow-control interval: [28, 28]
2020-03-31 16:26:05 139748626654976 [Note] WSREP: Trying to continue unpaused monitor
2020-03-31 16:26:05 139748626654976 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 36013067)
2020-03-31 16:26:05 139748980841216 [Note] WSREP: State transfer required:
        Group state: 186fc65c-fcd7-11e7-a48c-e72eabf327ae:36013067
        Local state: 00000000-0000-0000-0000-000000000000:-1
2020-03-31 16:26:05 139748980841216 [Note] WSREP: New cluster view: global state: 186fc65c-fcd7-11e7-a48c-e72eabf327ae:36013067, view# 1387: Primary, number of nodes: 3, my index: 2, protocol version 3
2020-03-31 16:26:05 139748980841216 [Warning] WSREP: Gap in state sequence. Need state transfer.
2020-03-31 16:26:05 139748597298944 [Note] WSREP: Running: 'wsrep_sst_xtrabackup --role 'joiner' --address '10.10.10.21:4569' --datadir '/data/db/'   --parent '4326' --binlog '/var/log/mysql/mariadb-bin' '
WSREP_SST: [INFO] Streaming with tar (20200331 16:26:05.174)
WSREP_SST: [INFO] Using socat as streamer (20200331 16:26:05.176)
WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:4569,reuseaddr stdio | tar xfi - --recursive-unlink -h; RC=( ${PIPESTATUS[@]} ) (20200331 16:26:05.186)
2020-03-31 16:26:05 139748980841216 [Note] WSREP: Prepared SST request: xtrabackup|10.10.10.21:4569/xtrabackup_sst
2020-03-31 16:26:05 139748980841216 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
2020-03-31 16:26:05 139748980841216 [Note] WSREP: REPL Protocols: 7 (3, 2)
2020-03-31 16:26:05 139748980841216 [Note] WSREP: Assign initial position for certification: 36013067, protocol version: 3
2020-03-31 16:26:05 139748684601088 [Note] WSREP: Service thread queue flushed.
2020-03-31 16:26:05 139748980841216 [Warning] WSREP: Failed to prepare for incremental state transfer: Local state UUID (00000000-0000-0000-0000-000000000000) does not match group state UUID (186fc65c-fcd7-11e7-a48c-e72eabf327ae): 1 (Operation not permitted)
         at galera/src/replicator_str.cpp:prepare_for_IST():482. IST will be unavailable.
2020-03-31 16:26:05 139748626654976 [Note] WSREP: Member 2.0 (Albatross01) requested state transfer from '*any*'. Selected 0.0 (Albatross03)(SYNCED) as donor.
2020-03-31 16:26:05 139748626654976 [Note] WSREP: Shifting PRIMARY -> JOINER (TO: 36013067)
2020-03-31 16:26:05 139748980841216 [Note] WSREP: Requesting state transfer: success, donor: 0
2020-03-31 16:26:05 139748980841216 [Note] WSREP: GCache history reset: 00000000-0000-0000-0000-000000000000:0 -> 186fc65c-fcd7-11e7-a48c-e72eabf327ae:36013067
2020/03/31 16:26:05 socat[4489] E bind(6, {AF=2 0.0.0.0:4569}, 16): Address already in use
tar: This does not look like a tar archive
tar: Exiting with failure status due to previous errors
WSREP_SST: [ERROR] Error while getting data from donor node:  exit codes: 1 2 (20200331 16:26:05.192)
WSREP_SST: [ERROR] Cleanup after exit with status:32 (20200331 16:26:05.193)
WSREP_SST: [INFO] Removing the sst_in_progress file (20200331 16:26:05.195)
2020-03-31 16:26:05 139748597298944 [ERROR] WSREP: Process completed with error: wsrep_sst_xtrabackup --role 'joiner' --address '10.10.10.21:4569' --datadir '/data/db/'   --parent '4326' --binlog '/var/log/mysql/mariadb-bin' : 32 (Broken pipe)
2020-03-31 16:26:05 139748597298944 [ERROR] WSREP: Failed to read uuid:seqno and wsrep_gtid_domain_id from joiner script.
2020-03-31 16:26:05 139748981156096 [ERROR] WSREP: SST failed: 32 (Broken pipe)
2020-03-31 16:26:05 139748981156096 [ERROR] Aborting

Бесплатно посоветовать мне. Спасибо за чтение.

Добро пожаловать на сайт PullRequest, где вы можете задавать вопросы и получать ответы от других членов сообщества.
...