文档章节

ceph osd盘挂掉,无法修复

加油2018
 加油2018
发布于 2014/12/24 10:57
字数 1686
阅读 1884
收藏 0

由于网络更换ip段,导致ceph启动时monitor进程无法启动,解决了这个问题后,ceph能够启动,但一块osd盘坏掉了,错误日志如下:

2014-12-24 10:53:30.353262 7f3fbbd78800  0 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578), process ceph-osd, pid 9794
2014-12-24 10:53:30.359829 7f3fbbd78800  0 filestore(/var/lib/ceph/osd/ceph-4) backend xfs (magic 0x58465342)
2014-12-24 10:53:30.536660 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: FIEMAP ioctl is supported and appears to work
2014-12-24 10:53:30.536683 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: FIEMAP ioctl is disabled via 'filestore fiemap' config option
2014-12-24 10:53:30.537600 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: syscall(SYS_syncfs, fd) fully supported
2014-12-24 10:53:30.537684 7f3fbbd78800  0 xfsfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_feature: extsize is disabled by conf
2014-12-24 10:53:30.537684 7f3fbbd78800  0 xfsfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_feature: extsize is disabled by conf
2014-12-24 10:53:30.592079 7f3fbbd78800  0 filestore(/var/lib/ceph/osd/ceph-4) mount: enabling WRITEAHEAD journal mode: checkpoint is not enabled
2014-12-24 10:53:40.394856 7f3fbbd78800 -1 os/FileStore.cc: In function 'int FileStore::_check_replay_guard(int, const SequencerPosition&)' thread 7f3fbbd78800 time 2014-12-24 10:53:40.391498
os/FileStore.cc: 2181: FAILED assert(!m_filestore_fail_eio || r != -5)

 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578)
 1: (FileStore::_check_replay_guard(int, SequencerPosition const&)+0x2d3) [0x730633]
 2: (FileStore::_check_replay_guard(coll_t, ghobject_t, SequencerPosition const&)+0x148) [0x742918]
 3: (FileStore::_do_transaction(ObjectStore::Transaction&, unsigned long, int, ThreadPool::TPHandle*)+0x58a4) [0x75e7a4]
 4: (FileStore::_do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long, ThreadPool::TPHandle*)+0x64) [0x763894]
 5: (JournalingObjectStore::journal_replay(unsigned long)+0x47c) [0x77578c]
 6: (FileStore::mount()+0x2baa) [0x751d4a]
 7: (OSD::do_convertfs(ObjectStore*)+0x1a) [0x64e75a]
 8: (main()+0x33e4) [0x613044]
 9: (__libc_start_main()+0xfd) [0x31e6a1ed1d]
 10: /usr/bin/ceph-osd() [0x60ec19]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- begin dump of recent events ---
   -21> 2014-12-24 10:53:30.348526 7f3fbbd78800  5 asok(0x5278230) register_command perfcounters_dump hook 0x5200010
   -20> 2014-12-24 10:53:30.348569 7f3fbbd78800  5 asok(0x5278230) register_command 1 hook 0x5200010
   -19> 2014-12-24 10:53:30.348579 7f3fbbd78800  5 asok(0x5278230) register_command perf dump hook 0x5200010
   -18> 2014-12-24 10:53:30.348590 7f3fbbd78800  5 asok(0x5278230) register_command perfcounters_schema hook 0x5200010
   -17> 2014-12-24 10:53:30.348600 7f3fbbd78800  5 asok(0x5278230) register_command 2 hook 0x5200010
   -19> 2014-12-24 10:53:30.348579 7f3fbbd78800  5 asok(0x5278230) register_command perf dump hook 0x5200010
   -18> 2014-12-24 10:53:30.348590 7f3fbbd78800  5 asok(0x5278230) register_command perfcounters_schema hook 0x5200010
   -17> 2014-12-24 10:53:30.348600 7f3fbbd78800  5 asok(0x5278230) register_command 2 hook 0x5200010
   -16> 2014-12-24 10:53:30.348609 7f3fbbd78800  5 asok(0x5278230) register_command perf schema hook 0x5200010

   -15> 2014-12-24 10:53:30.348618 7f3fbbd78800  5 asok(0x5278230) register_command config show hook 0x5200010
asok(0x5278230) register_command config show hook 0x5200010
   -14> 2014-12-24 10:53:30.348629 7f3fbbd78800  5    -14> 2014-12-24 10:53:30.348629 7f3fbbd78800  5 asok(0x5278230) register_command config set hook 0x5200010
   -13> 2014-12-24 10:53:30.348638 7f3fbbd78800  5 asok(0x5278230) register_command config get hook 0x5200010asok(0x5278230) register_command config get hook 0x5200010
   -12> 2014-12-24 10:53:30.348647 7f3fbbd78800  5 asok(0x5278230) register_command config diff hook 0x5200010asok(0x5278230) register_command config diff hook 0x5200010
   -11> 2014-12-24 10:53:30.348657 7f3fbbd78800  5 asok(0x5278230) register_command log flush hook 0x5200010asok(0x5278230) register_command log flush hook 0x5200010
   -10> 2014-12-24 10:53:30.348666 7f3fbbd78800  5 asok(0x5278230) register_command log dump hook 0x5200010asok(0x5278230) register_command log dump hook 0x5200010
    -9> 2014-12-24 10:53:30.348675 7f3fbbd78800  5 asok(0x5278230) register_command log reopen hook 0x5200010
    -8> 2014-12-24 10:53:30.353262 7f3fbbd78800  0 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578), process ceph-osd, pid 9794
    -7> 2014-12-24 10:53:30.355214 7f3fbbd78800  1 finished global_init_daemonize
    -6> 2014-12-24 10:53:30.359829 7f3fbbd78800  0 filestore(/var/lib/ceph/osd/ceph-4) backend xfs (magic 0x58465342)
    -5> 2014-12-24 10:53:30.536660 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: FIEMAP ioctl is supported and appears to work
    -4> 2014-12-24 10:53:30.536683 7f3fbbd78800  0     -5> 2014-12-24 10:53:30.536660 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: FIEMAP ioctl is supported and appears to work
    -4> 2014-12-24 10:53:30.536683 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: FIEMAP ioctl is disabled via 'filestore fiemap' config option
    -3> 2014-12-24 10:53:30.537600 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/o    -3> 2014-12-24 10:53:30.537600 7f3fbbd78800  0 genericfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_features: syscall(SYS_syncfs, fd) fully supported
    -2> 2014-12-24 10:53:30.537684 7f3fbbd78800  0 xfsfilestorebackend(/var/lib/ceph/osd/ceph-4) detect_feature: extsize is disabled by conf

    -1> 2014-12-24 10:53:30.592079 7f3fbbd78800  0 filestore(/var/lib/ceph/osd/ceph-4) mount: enabling WRITEAHEAD journal mode: checkpoint is not enabled

     0> 2014-12-24 10:53:40.394856 7f3fbbd78800 -1 os/FileStore.cc: In function 'int FileStore::_check_replay_guard(int, const SequencerPosition&)' thread 7f3fbbd78800 time 2014-12-24 10:53:40.391498
os/FileStore.cc: 2181: FAILED assert(!m_filestore_fail_eio || r != -5)

 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578)
 1: (FileStore::_check_replay_guard(int, SequencerPosition const&)+0x2d3) [0x730633]
 2: (FileStore::_check_replay_guard(coll_t, ghobject_t, SequencerPosition const&)+0x148) [0x742918]
 3: (FileStore::_do_transaction(ObjectStore::Transaction&, unsigned long, int, ThreadPool::TPHandle*)+0x58a4) [0x75e7a4]
 4: (FileStore::_do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long, ThreadPool::TPHandle*)+0x64) [0x763894]
 5: (JournalingObjectStore::journal_replay(unsigned long)+0x47c) [0x77578c]
 6: (FileStore::mount()+0x2baa) [0x751d4a]
 7: (OSD::do_convertfs(ObjectStore*)+0x1a) [0x64e75a]
 8: (main()+0x33e4) [0x613044]
 9: (__libc_start_main()+0xfd) [0x31e6a1ed1d]
 10: /usr/bin/ceph-osd() [0x60ec19]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- logging levels ---
   0/ 5 noneos/FileStore.cc: In function 'int FileStore::_check_replay_guard(int, const SequencerPosition&)' thread 7f3fbbd78800 time 2014-12-24 10:53:40.391498
os/FileStore.cc: 2181: FAILED assert(!m_filestore_fail_eio || r != -5)

 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578)
 1: (FileStore::_check_replay_guard(int, SequencerPosition const&)+0x2d3) [0x730633]
 2: (FileStore::_check_replay_guard(coll_t, ghobject_t, SequencerPosition const&)+0x148) [0x742918]
 3: (FileStore::_do_transaction(ObjectStore::Transaction&, unsigned long, int, ThreadPool::TPHandle*)+0x58a4) [0x75e7a4]
 4: (FileStore::_do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long, ThreadPool::TPHandle*)+0x64) [0x763894]
 5: (JournalingObjectStore::journal_replay(unsigned long)+0x47c) [0x77578c]
 6: (FileStore::mount()+0x2baa) [0x751d4a]
 7: (OSD::do_convertfs(ObjectStore*)+0x1a) [0x64e75a]
 8: (main()+0x33e4) [0x613044]
 9: (__libc_start_main()+0xfd) [0x31e6a1ed1d]
 10: /usr/bin/ceph-osd() [0x60ec19]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- logging levels ---
   0/ 5 none
   0/ 0 lockdep
   0/ 0 context
   0/ 0 crush
   0/ 0 lockdep
   0/ 0 context
   0/ 0 crush
   0/ 0 mds
   1/ 5 mds_balancer
   1/ 5 mds_balancer
   1/ 5 mds_locker   1/ 5 mds_locker
   1/ 5 mds_log   1/ 5 mds_log
   1/ 5 mds_log_expire

   1/ 5 mds_migrator   1/ 5 mds_migrator
   0/ 0 buffer

   0/ 0 timer

   0/ 1 filer

   0/ 1 striper

   0/ 1 objecter

   0/ 5 rados

   0/ 0 rbd

   0/ 5 rbd_replay

   0/ 0 journaler

   0/ 5 objectcacher
   0/ 5 client   0/ 5 client
   0/ 0 osd   0/ 0 osd
   0/ 0 optracker   0/ 0 optracker
   0/ 0 objclass   0/ 0 objclass
   0/ 0 filestore   0/ 0 filestore
   1/ 3 keyvaluestore
   1/ 3 keyvaluestore
   0/ 0 journal

   0/ 0 ms

   0/ 0 mon
   0/ 0 monc
   0/ 0 paxos
   0/ 0 tp

   0/ 0 monc
   0/ 0 paxos
   0/ 0 tp
   0/ 0 auth
   1/ 5 crypto
   1/ 5 crypto
   0/ 0 finisher

   0/ 0 heartbeatmap

   0/ 0 perfcounter

   1/ 5 rgw

   1/10 civetweb

   1/ 5 javaclient

   0/ 0 asok

   0/ 0 throttle

   0/ 0 refs

  -2/-2 (syslog threshold)

  -1/-1 (stderr threshold)

  max_recent     10000

  max_new         1000

  log_file /var/log/ceph/ceph-osd.4.log

--- end dump of recent events ---
2014-12-24 10:53:40.400275 7f3fbbd78800 -1 *** Caught signal (Aborted) **
 in thread 7f3fbbd78800

 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578)
 1: /usr/bin/ceph-osd() [0xa02425]
 2: /lib64/libpthread.so.0() [0x31e6e0f710]
 3: (gsignal()+0x35) [0x31e6a32925]
 4: (abort()+0x175) [0x31e6a34105]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x12d) [0x31f2abea7d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- begin dump of recent events ---
     0> 2014-12-24 10:53:40.400275 7f3fbbd78800 -1 *** Caught signal (Aborted) **
 in thread 7f3fbbd78800

 ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578)
 1: /usr/bin/ceph-osd() [0xa02425]
 2: /lib64/libpthread.so.0() [0x31e6e0f710]
 3: (gsignal()+0x35) [0x31e6a32925]
 4: (abort()+0x175) [0x31e6a34105]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x12d) [0x31f2abea7d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- logging levels ---
   0/ 5 none
   0/ 0 lockdep
   0/ 0 context
   0/ 0 crush
   0/ 0 mds
   1/ 5 mds_balancer

   0/ 0 lockdep
   0/ 0 context
   0/ 0 crush
   0/ 0 mds
   1/ 5 mds_balancer
   1/ 5 mds_locker
   1/ 5 mds_log
   1/ 5 mds_log_expire
   1/ 5 mds_log_expire
   1/ 5 mds_migrator
   1/ 5 mds_migrator
   0/ 0 buffer

   0/ 0 timer
   0/ 1 filer

   0/ 1 striper

   0/ 1 objecter

   0/ 5 rados

   0/ 0 rbd

   0/ 5 rbd_replay

   0/ 0 journaler

   0/ 5 objectcacher

   0/ 5 client

   0/ 0 osd

   0/ 0 optracker

   0/ 0 objclass

   0/ 0 filestore

   1/ 3 keyvaluestore

   0/ 0 journal

   0/ 0 ms

   0/ 0 mon

   0/ 0 monc

   0/ 0 paxos
   0/ 0 tp   0/ 0 tp
   0/ 0 auth   0/ 0 auth
   1/ 5 crypto   1/ 5 crypto
   0/ 0 finisher   0/ 0 finisher
   0/ 0 heartbeatmap
   0/ 0 perfcounter   0/ 0 heartbeatmap
   0/ 0 perfcounter
   1/ 5 rgw

   1/10 civetweb

   1/ 5 javaclient

   0/ 0 asok
   0/ 0 throttle
   0/ 0 refs
  -2/-2 (syslog threshold)
  -1/-1 (stderr threshold)
  max_recent     10000
  max_new         1000
  0/ 0 asok
   0/ 0 throttle
   0/ 0 refs
  -2/-2 (syslog threshold)
  -1/-1 (stderr threshold)
  max_recent     10000
  max_new         1000
  log_file /var/log/ceph/ceph-osd.4.log
--- end dump of recent events ---



暂时无法解决,有遇到过类似问题的朋友吗?

© 著作权归作者所有

共有 人打赏支持
加油2018
粉丝 150
博文 276
码字总数 243077
作品 0
海淀
架构师
私信 提问
加载中

评论(4)

haiquan517
haiquan517
首先查看monmap表, 如果都改过来了, 尝试重建journallog
加油2018
加油2018

引用来自“oscfox”的评论

这还真没遇到过,不过更换ip段不需要stop整个集群,重新push config 然后再重启吗?
公司搬家,所有服务器都分配了新的ip。stop了,修改配置后start,遇到了上面的问题。
Yashin
Yashin
这还真没遇到过,不过更换ip段不需要stop整个集群,重新push config 然后再重启吗?
叶秀兰
叶秀兰
这个去讨论区问比较靠谱吧,比较多人看到,可以讨论。@oscfox
ip改变引起的ceph monitor异常及osd盘崩溃的总结

公司搬家,所有服务器的ip改变。对ceph服务器配置好ip后启动,发现monitor进程启动失败,monitor进程总是试图绑定到以前的ip地址,那当然不可能成功了。开始以为服务器的ip设置有问题,在改变...

加油2018
2014/12/25
0
6
Openstack之Ceph集群操作

Oepnstack之CEPH系列是根据Ceph Cookbook整理的笔记,分为以下几个部分: 1. 《Ceph简介》 2. 《Ceph集群操作》 3. 《Ceph块设备管理与Openstack配置》 4. 《深入Ceph》 5. 《ceph优化与性能...

hiubuntu
2016/12/01
0
0
CEPH 可靠性的计算方法分析

原文出处:RaySun的博客 在开始正文之前,首先要感谢UnitedStack工程师朱荣泽对这篇博文的大力帮助和悉心指教。本文主要针对UnitedStack公司在巴黎峰 会上对Ceph可靠性的计算方法(https://ww...

红薯
2015/01/25
3K
0
开源实践分享:Ceph bluestore部署实践

Ceph bluestore部署 首先为大家分享Ceph bluestore具体该如何部署,使用环境如下 • 单节点 • CentOS 7.4 • 使用 ceph-deploy 进行部署 准备 centos 7.4 系统的机器,并配置好 ssh 的无密码...

OpenInfra
2018/05/24
0
0
ceph分布式存储实战(3)——ceph内部组件

一、RADOS(Reliable Autonomic Distributed Object Storage) 1、rados的意思是 可靠、自动、分布式对象存储(可靠的自主分布式对象存储) 2、rados的作用 1、rados是ceph集群的基础,是cep...

TtrToby
2018/01/05
0
0

没有更多内容

加载失败,请刷新页面

加载更多

详解webpack-dev-server的简单使用

webpack-dev-server是一个小型的Node.js Express服务器,它使用webpack-dev-middleware来服务于webpack的包,除此自外,它还有一个通过Sock.js来连接到服务器的微型运行时. 我们来看一下下面的...

前端攻城老湿
27分钟前
0
0
深度解析JavaScript事件对象

这篇文章主要介绍了JavaScript事件对象,结合实例形式深入分析了javascript DOM、IE及其他浏览器相关事件对象操作技巧与注意事项,写的十分的全面细致,具有一定的参考价值,对此有需要的朋友可...

前端攻城小牛
28分钟前
1
0
Android下拉刷新开源框架

添加依赖 //下拉刷新 implementation 'com.jcodecraeer:xrecyclerview:1.5.9' xml引用 <com.jcodecraeer.xrecyclerview.XRecyclerView android:id="@+id/act_xrecycler......

lanyu96
35分钟前
1
0
Linux内核中ioremap映射的透彻理解

几乎每一种外设都是通过读写设备上的寄存器来进行的,通常包括控制寄存器、状态寄存器和数据寄存器三大类,外设的寄存器通常被连续地编址。根据CPU体系结构的不同,CPU对IO端口的编址方式有两...

天王盖地虎626
39分钟前
2
0
Collection中的之retainAll()方法的理解

//在jdkapi中的方法,说明返回值为boolean类型, boolean retainAll(Collection<?> c) ; //api中给的注释 //Retains only the elements in this list that are contained in the specified......

南桥北木
42分钟前
1
0

没有更多内容

加载失败,请刷新页面

加载更多

返回顶部
顶部