Dobrica Pavlinusic [Sat, 28 Jul 2018 08:09:08 +0000 (10:09 +0200)]
create rbd snap from instance disk
Dobrica Pavlinusic [Sat, 28 Jul 2018 07:43:48 +0000 (09:43 +0200)]
format image snapshots in copy/paste friendly format
Dobrica Pavlinusic [Fri, 27 Jul 2018 22:11:54 +0000 (00:11 +0200)]
list all istance rbd disks and snapshots
Dobrica Pavlinusic [Thu, 26 Jul 2018 14:08:05 +0000 (16:08 +0200)]
document windows 2003 import into ganeti
Dobrica Pavlinusic [Wed, 25 Jul 2018 14:14:05 +0000 (16:14 +0200)]
move windows-backup.txt to doc
Dobrica Pavlinusic [Wed, 25 Jul 2018 14:04:20 +0000 (16:04 +0200)]
added tags
Dobrica Pavlinusic [Wed, 25 Jul 2018 14:03:00 +0000 (16:03 +0200)]
move grml documentation into doc dir
Dobrica Pavlinusic [Wed, 25 Jul 2018 14:00:52 +0000 (16:00 +0200)]
create /dev/shm/instance, fix domain
Dobrica Pavlinusic [Tue, 17 Jul 2018 13:36:08 +0000 (15:36 +0200)]
use only layering feature of ceph until upgrade
Dobrica Pavlinusic [Tue, 17 Jul 2018 13:08:20 +0000 (15:08 +0200)]
document ganeti upgrade to stretch
Dobrica Pavlinusic [Mon, 16 Jul 2018 07:45:58 +0000 (09:45 +0200)]
show also unknown lines
Dobrica Pavlinusic [Mon, 2 Jul 2018 08:23:26 +0000 (10:23 +0200)]
document booting grml cdrom image for recovery
Dobrica Pavlinusic [Sat, 23 Jun 2018 14:00:38 +0000 (16:00 +0200)]
use /dev/shm/mac-ip-vlan from enesej to find instance ip
Dobrica Pavlinusic [Mon, 29 Jan 2018 15:06:25 +0000 (16:06 +0100)]
process each instance only once
Useful for instances with more than one disk
Dobrica Pavlinusic [Sun, 28 Jan 2018 06:17:48 +0000 (07:17 +0100)]
sort output
Dobrica Pavlinusic [Sat, 27 Jan 2018 16:56:08 +0000 (17:56 +0100)]
added load count to diganose firmware errors
Dobrica Pavlinusic [Sat, 27 Jan 2018 16:55:17 +0000 (17:55 +0100)]
added Load cycle count
Dobrica Pavlinusic [Sat, 27 Jan 2018 16:53:49 +0000 (17:53 +0100)]
rebuild broken sencoday node drbd
This is really ugly, and requires instance reboot, but it allows
you to resize your disk array and do single transfer. YMMV
Dobrica Pavlinusic [Sat, 27 Jan 2018 12:30:21 +0000 (13:30 +0100)]
added OSFAMIY for snf-image
Dobrica Pavlinusic [Sat, 27 Jan 2018 12:29:39 +0000 (13:29 +0100)]
more compact output
Dobrica Pavlinusic [Fri, 26 Jan 2018 15:13:24 +0000 (16:13 +0100)]
iostat: replace disks with instance:disk and hide idle ones
Dobrica Pavlinusic [Tue, 23 Jan 2018 15:08:52 +0000 (16:08 +0100)]
report drives without tests run on them
Dobrica Pavlinusic [Tue, 23 Jan 2018 15:01:05 +0000 (16:01 +0100)]
clean argument purges /dev/shm/*smart* cache files
Dobrica Pavlinusic [Mon, 22 Jan 2018 16:53:38 +0000 (17:53 +0100)]
Merge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Mon, 22 Jan 2018 16:50:14 +0000 (17:50 +0100)]
don't use host tag so instance are hidden from chronograf hosts view
Dobrica Pavlinusic [Fri, 19 Jan 2018 12:01:07 +0000 (13:01 +0100)]
Merge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Fri, 19 Jan 2018 11:04:23 +0000 (12:04 +0100)]
added serial, use cache output in /dev/shm
Dobrica Pavlinusic [Fri, 19 Jan 2018 11:03:59 +0000 (12:03 +0100)]
create cache output in /dev/shm/gnt-smart
Dobrica Pavlinusic [Thu, 18 Jan 2018 10:52:05 +0000 (11:52 +0100)]
fix hwraid.list path
Dobrica Pavlinusic [Thu, 18 Jan 2018 10:49:57 +0000 (11:49 +0100)]
fix smart megaraid detection pattern
Dobrica Pavlinusic [Thu, 18 Jan 2018 10:44:58 +0000 (11:44 +0100)]
cleanup log output
root [Wed, 17 Jan 2018 18:22:45 +0000 (19:22 +0100)]
cleanup Firmware version 1.22.52-1909 built on Sep 21 2012 at 15:29:16
Dobrica Pavlinusic [Wed, 17 Jan 2018 18:19:00 +0000 (19:19 +0100)]
Merge branch 'master' of github.com:ffzg/gnt-info
Dobrica Pavlinusic [Wed, 17 Jan 2018 18:18:39 +0000 (19:18 +0100)]
LSI controller boot log
Dobrica Pavlinusic [Wed, 17 Jan 2018 16:45:36 +0000 (17:45 +0100)]
correctly format output with errors
Dobrica Pavlinusic [Wed, 17 Jan 2018 16:40:00 +0000 (17:40 +0100)]
check if we need to re-run smart tests
Dobrica Pavlinusic [Wed, 17 Jan 2018 10:57:07 +0000 (11:57 +0100)]
better detect megaraid controllers
Dobrica Pavlinusic [Mon, 15 Jan 2018 18:34:13 +0000 (19:34 +0100)]
Merge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Mon, 15 Jan 2018 18:32:58 +0000 (19:32 +0100)]
generate full report by default
just add '^# 1' as arument to get quick report
Dobrica Pavlinusic [Mon, 15 Jan 2018 16:52:26 +0000 (17:52 +0100)]
report test remaining if self-test is running
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:46:11 +0000 (14:46 +0100)]
Merge remote-tracking branch 'lib15/lib15'
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:41:26 +0000 (14:41 +0100)]
use smartctl on megaraid controller only once
If you export multiple vds from controller, they will all have
full set of disks attached from smartctl's point of view, so there
is no need to run this more than once
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:23:40 +0000 (14:23 +0100)]
report smart for whole cluster
Dobrica Pavlinusic [Sun, 14 Jan 2018 12:55:46 +0000 (13:55 +0100)]
execute SMART smartctl params on each drive
e.g. SMART="-t long" ./smart-megaraid.sh
Dobrica Pavlinusic [Sun, 14 Jan 2018 12:28:42 +0000 (13:28 +0100)]
smart script is not ganeti specific
Dobrica Pavlinusic [Sun, 14 Jan 2018 11:51:25 +0000 (12:51 +0100)]
report smart status from megaraid controller
Dobrica Pavlinusic [Sun, 14 Jan 2018 10:29:02 +0000 (11:29 +0100)]
display last selftest log for each disk
Dobrica Pavlinusic [Tue, 5 Dec 2017 12:08:10 +0000 (13:08 +0100)]
retrive args from rsync server for each instance if it exists
This allows us server-side --exclude 'var/lib/php5/sess_*' and similar
directives which can't really be specified in server rsyncd.conf beacause
it will produce rsync error, and not remove snapshot at end
(and prevent future backups)
Dobrica Pavlinusic [Tue, 7 Nov 2017 07:32:30 +0000 (08:32 +0100)]
display vnc connection info over ssh for instance
Dobrica Pavlinusic [Thu, 5 Oct 2017 21:13:27 +0000 (23:13 +0200)]
ask ganeti for vg, execute zfs snap using ssh command=
Dobrica Pavlinusic [Thu, 5 Oct 2017 16:44:49 +0000 (18:44 +0200)]
use backup enviroment var for top-level backup dir
Dobrica Pavlinusic [Sun, 1 Oct 2017 07:03:17 +0000 (09:03 +0200)]
find instances running with closed lvs which require gnt-instance replace-disks
Dobrica Pavlinusic [Sat, 30 Sep 2017 13:33:33 +0000 (15:33 +0200)]
prefix output with machine name
Dobrica Pavlinusic [Wed, 27 Sep 2017 15:59:17 +0000 (17:59 +0200)]
fix interval skip
Dobrica Pavlinusic [Tue, 26 Sep 2017 18:45:46 +0000 (20:45 +0200)]
report power supply metrics
Dobrica Pavlinusic [Thu, 8 Jun 2017 11:24:14 +0000 (13:24 +0200)]
copy acls also to backup
Dobrica Pavlinusic [Sun, 28 May 2017 19:09:41 +0000 (21:09 +0200)]
Merge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Sun, 28 May 2017 19:07:31 +0000 (21:07 +0200)]
use snf-image to backup windows machines
root [Tue, 23 May 2017 07:31:42 +0000 (09:31 +0200)]
insert 0 for vlan if unknown
Dobrica Pavlinusic [Mon, 22 May 2017 20:43:33 +0000 (22:43 +0200)]
watch drbd and mdstat sync
Dobrica Pavlinusic [Mon, 22 May 2017 19:05:04 +0000 (21:05 +0200)]
rsync extended attrs to preserve capabilities
Dobrica Pavlinusic [Mon, 22 May 2017 19:04:08 +0000 (21:04 +0200)]
don't exit if wathed job failed
Dobrica Pavlinusic [Mon, 22 May 2017 19:03:02 +0000 (21:03 +0200)]
sleep only if needed
Dobrica Pavlinusic [Thu, 20 Apr 2017 13:23:22 +0000 (15:23 +0200)]
don't send data to influx from gnt-i
Dobrica Pavlinusic [Thu, 20 Apr 2017 13:21:59 +0000 (15:21 +0200)]
prefix measurement with ganeti_
This allows us to insert data into any database (e.g. telegraf)
Also check if wait if larger than inverval to fix negative time wait.
Dobrica Pavlinusic [Tue, 14 Feb 2017 11:35:53 +0000 (12:35 +0100)]
divide all values by $INTERVAL so we work for intervals which are > 1 s
Dobrica Pavlinusic [Mon, 13 Feb 2017 11:05:49 +0000 (12:05 +0100)]
sleep correct amount of time regardles of processing time
Dobrica Pavlinusic [Mon, 13 Feb 2017 10:36:11 +0000 (11:36 +0100)]
make timestamp integer
Dobrica Pavlinusic [Mon, 13 Feb 2017 10:34:11 +0000 (11:34 +0100)]
emit timestamp from local node
Dobrica Pavlinusic [Thu, 9 Feb 2017 13:47:48 +0000 (14:47 +0100)]
rsync --inplace helps copy-on-write filesystems
Dobrica Pavlinusic [Thu, 9 Feb 2017 13:47:22 +0000 (14:47 +0100)]
Merge branch 'master' of github.com:ffzg/gnt-info
Dobrica Pavlinušić [Thu, 9 Feb 2017 13:38:13 +0000 (14:38 +0100)]
Merge pull request #1 from mnalis/master
original code has problems: dies if no running jobs at the moment (gn…
Matija Nalis [Thu, 9 Feb 2017 12:15:58 +0000 (13:15 +0100)]
original code has problems: dies if no running jobs at the moment (gnt-job watch without args throws error), dies if more than one job running (gnt-job watch with multiple args throws error), hammers the server if no jobs are running (no sleep)
Dobrica Pavlinusic [Wed, 18 Jan 2017 10:58:15 +0000 (11:58 +0100)]
fix typo
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:50:57 +0000 (09:50 +0100)]
more robust optional offset mount
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:26:25 +0000 (09:26 +0100)]
sleep before mount, remove shell on finish
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:24:43 +0000 (09:24 +0100)]
umount snapshots, remove mount directories
Dobrica Pavlinusic [Fri, 23 Dec 2016 18:01:56 +0000 (19:01 +0100)]
remove mount directory which functions as a lock
this allows us to find failed backups and never re-try for that host
(until reboot of instance and /dev/shm cleanup that is)
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:50:49 +0000 (18:50 +0100)]
remove boot flag from fdisk output
this created wrong offset... sfdisk -d output is much nicer,
but I'm not sure that it's dependency is worth it
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:50:37 +0000 (18:50 +0100)]
read instance and disk from stdin with -
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:49:32 +0000 (18:49 +0100)]
remove snapshots on all nodes in cluster
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:32:13 +0000 (18:32 +0100)]
few careful shell escapes, and now it works
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:20:24 +0000 (18:20 +0100)]
mount first Linux partition
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:13:55 +0000 (18:13 +0100)]
specify instance and disk on command-line
Dobrica Pavlinusic [Fri, 23 Dec 2016 10:18:22 +0000 (11:18 +0100)]
mount snapshot read-only
Dobrica Pavlinusic [Fri, 23 Dec 2016 07:39:33 +0000 (08:39 +0100)]
create lv snapshot and rsync it to backup server
Dobrica Pavlinusic [Thu, 22 Dec 2016 21:49:11 +0000 (22:49 +0100)]
re-run watcher if there are more running jobs
Dobrica Pavlinusic [Wed, 21 Dec 2016 16:44:05 +0000 (17:44 +0100)]
remove haning snapshots after gnt-backup export
Dobrica Pavlinusic [Wed, 21 Dec 2016 12:38:44 +0000 (13:38 +0100)]
backup all running instances
Dobrica Pavlinusic [Tue, 20 Dec 2016 13:30:19 +0000 (14:30 +0100)]
move backup server to lib15
Dobrica Pavlinusic [Fri, 9 Dec 2016 19:18:38 +0000 (20:18 +0100)]
make-pidfile
Dobrica Pavlinusic [Fri, 9 Dec 2016 19:16:48 +0000 (20:16 +0100)]
added cpu ticks (not divided by vcpus as cpu is)
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:53:09 +0000 (19:53 +0100)]
create bug dumps on curl failure
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:52:52 +0000 (19:52 +0100)]
move location generation to avoid warning
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:34:45 +0000 (19:34 +0100)]
tags don't need quotes in influxdb, they are always ascii indexed in memory
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:32:58 +0000 (19:32 +0100)]
document bug and add workaround
Dobrica Pavlinusic [Fri, 9 Dec 2016 10:18:03 +0000 (11:18 +0100)]
fix vlan which can't be integer since it's tag
Dobrica Pavlinusic [Fri, 9 Dec 2016 09:41:39 +0000 (10:41 +0100)]
remove extra --background in stop which is not needed
Dobrica Pavlinusic [Fri, 9 Dec 2016 09:40:59 +0000 (10:40 +0100)]
stop/start gnt-monitor using systemd
Dobrica Pavlinusic [Thu, 8 Dec 2016 21:50:03 +0000 (22:50 +0100)]
systemd default file