Etcd2 fails to start on the CoreOS node

coreosetcd

I am trying to start etcd2 in my CoreOS node.

I have this in my cloud-config:

coreos:
  etcd2:
    discovery: https://discovery.etcd.io/new?size=1
    advertise-client-urls: http://127.0.0.1:2379,http://127.0.0.1:4001
    initial-advertise-peer-urls: http://127.0.0.1:2380
    listen-client-urls: http://0.0.0.0:2379,http://0.0.0.0:4001
    listen-peer-urls: http://127.0.0.1:2380

After the installation, when I boot the system I get the error (according to the logs):

etcdmain: invalid character 'p' after top-level value

and etcd2 fails to start.

What does that mean?
I have followed the guides on https://coreos.com/os/docs/latest/cloud-config.html and https://coreos.com/os/docs/latest/cluster-discovery.html.

EDIT

Node 1

coreos:
  etcd2:
    name: coreos1
    discovery: https://discovery.etcd.io/2d585793b364cf8985ca7a983d6c52e3
    advertise-client-urls: http://127.0.0.1:2379,http://127.0.0.1:4001
    initial-advertise-peer-urls: http://127.0.0.1:2380
    listen-client-urls: http://0.0.0.0:2379,http://0.0.0.0:4001
    listen-peer-urls: http://127.0.0.1:2380

Node 2

coreos:
  etcd2:
    name: coreos2
    discovery: https://discovery.etcd.io/2d585793b364cf8985ca7a983d6c52e3
    advertise-client-urls: http://127.0.0.1:2379,http://127.0.0.1:4001
    initial-advertise-peer-urls: http://127.0.0.1:2380
    listen-client-urls: http://0.0.0.0:2379,http://0.0.0.0:4001
    listen-peer-urls: http://127.0.0.1:2380

coreos1> journalctl -u etcd2:

Sep 21 20:10:31 coreos1 etcd2[671]: 2015/09/21 20:10:31 discovery: found self e276d5b4c276a19d in the cluster
Sep 21 20:10:31 coreos1 etcd2[671]: 2015/09/21 20:10:31 discovery: found 1 peer(s), waiting for 1 more
Sep 21 20:11:31 coreos1 etcd2[671]: 2015/09/21 20:11:31 discovery: error #0: client: etcd member https://discovery.etcd.io returns server error [Gateway Timeout]
Sep 21 20:11:31 coreos1 etcd2[671]: 2015/09/21 20:11:31 discovery: waiting for other nodes: error connecting to https://discovery.etcd.io, retrying in 8m32s

coreos2> journalctl -u etcd2:

Sep 21 20:11:43 coreos2 systemd[1]: Starting etcd2...
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: etcd Version: 2.1.2
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: Git SHA: ff8d1ec
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: Go Version: go1.4.2
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: Go OS/Arch: linux/amd64
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: setting maximum number of CPUs to 1, total number of available CPUs is 1
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: listening for peers on http://127.0.0.1:2380
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: listening for client requests on http://0.0.0.0:2379
Sep 21 20:11:43 coreos2 etcd2[1515]: 2015/09/21 20:11:43 etcdmain: listening for client requests on http://0.0.0.0:4001
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: stopping listening for client requests on http://0.0.0.0:4001
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: stopping listening for client requests on http://0.0.0.0:2379
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: stopping listening for peers on http://127.0.0.1:2380
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: member "core2" has previously registered with discovery service token (https://discovery.etcd.io/2d585793b364cf8985ca7a983d6c52e3).
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: But etcd could not find vaild cluster configuration in the given data dir (/var/lib/etcd2).
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: Please check the given data dir path if the previous bootstrap succeeded
Sep 21 20:11:45 coreos2 etcd2[1515]: 2015/09/21 20:11:45 etcdmain: or use a new discovery token if the previous bootstrap failed.
Sep 21 20:11:45 coreos2 systemd[1]: etcd2.service: Main process exited, code=exited, status=1/FAILURE
Sep 21 20:11:45 coreos2 systemd[1]: etcd2.service: Unit entered failed state.
Sep 21 20:11:45 coreos2 systemd[1]: etcd2.service: Failed with result 'exit-code'.

Best Answer

Your discovery URL is incorrect - the URL https://discovery.etcd.io/new?size=1 is used to obtain a fresh discovery URL you can use in your configuration. Do this once manually, e.g. with curl:

curl --silent -H "Accept: text/plain" https://discovery.etcd.io/new?size=1

This will return a URL like this:

https://discovery.etcd.io/a93e30ebf9375f2385fef54c83b2840d

It's a URL like that which should be your discovery URL. Always use a fresh discovery URL whenever you build a new cluster.

Related Topic