Dobrica Pavlinusic [Thu, 18 Jan 2018 10:52:05 +0000 (11:52 +0100)]
fix hwraid.list path
Dobrica Pavlinusic [Thu, 18 Jan 2018 10:49:57 +0000 (11:49 +0100)]
fix smart megaraid detection pattern
Dobrica Pavlinusic [Thu, 18 Jan 2018 10:44:58 +0000 (11:44 +0100)]
cleanup log output
root [Wed, 17 Jan 2018 18:22:45 +0000 (19:22 +0100)]
cleanup Firmware version 1.22.52-1909 built on Sep 21 2012 at 15:29:16
Dobrica Pavlinusic [Wed, 17 Jan 2018 18:19:00 +0000 (19:19 +0100)]
Merge branch 'master' of github.com:ffzg/gnt-info
Dobrica Pavlinusic [Wed, 17 Jan 2018 18:18:39 +0000 (19:18 +0100)]
LSI controller boot log
Dobrica Pavlinusic [Wed, 17 Jan 2018 16:45:36 +0000 (17:45 +0100)]
correctly format output with errors
Dobrica Pavlinusic [Wed, 17 Jan 2018 16:40:00 +0000 (17:40 +0100)]
check if we need to re-run smart tests
Dobrica Pavlinusic [Wed, 17 Jan 2018 10:57:07 +0000 (11:57 +0100)]
better detect megaraid controllers
Dobrica Pavlinusic [Mon, 15 Jan 2018 18:34:13 +0000 (19:34 +0100)]
Merge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Mon, 15 Jan 2018 18:32:58 +0000 (19:32 +0100)]
generate full report by default
just add '^# 1' as arument to get quick report
Dobrica Pavlinusic [Mon, 15 Jan 2018 16:52:26 +0000 (17:52 +0100)]
report test remaining if self-test is running
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:46:11 +0000 (14:46 +0100)]
Merge remote-tracking branch 'lib15/lib15'
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:41:26 +0000 (14:41 +0100)]
use smartctl on megaraid controller only once
If you export multiple vds from controller, they will all have
full set of disks attached from smartctl's point of view, so there
is no need to run this more than once
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:23:40 +0000 (14:23 +0100)]
report smart for whole cluster
Dobrica Pavlinusic [Sun, 14 Jan 2018 12:55:46 +0000 (13:55 +0100)]
execute SMART smartctl params on each drive
e.g. SMART="-t long" ./smart-megaraid.sh
Dobrica Pavlinusic [Sun, 14 Jan 2018 12:28:42 +0000 (13:28 +0100)]
smart script is not ganeti specific
Dobrica Pavlinusic [Sun, 14 Jan 2018 11:51:25 +0000 (12:51 +0100)]
report smart status from megaraid controller
Dobrica Pavlinusic [Sun, 14 Jan 2018 10:29:02 +0000 (11:29 +0100)]
display last selftest log for each disk
Dobrica Pavlinusic [Tue, 5 Dec 2017 12:08:10 +0000 (13:08 +0100)]
retrive args from rsync server for each instance if it exists
This allows us server-side --exclude 'var/lib/php5/sess_*' and similar
directives which can't really be specified in server rsyncd.conf beacause
it will produce rsync error, and not remove snapshot at end
(and prevent future backups)
Dobrica Pavlinusic [Tue, 7 Nov 2017 07:32:30 +0000 (08:32 +0100)]
display vnc connection info over ssh for instance
Dobrica Pavlinusic [Thu, 5 Oct 2017 21:13:27 +0000 (23:13 +0200)]
ask ganeti for vg, execute zfs snap using ssh command=
Dobrica Pavlinusic [Thu, 5 Oct 2017 16:44:49 +0000 (18:44 +0200)]
use backup enviroment var for top-level backup dir
Dobrica Pavlinusic [Sun, 1 Oct 2017 07:03:17 +0000 (09:03 +0200)]
find instances running with closed lvs which require gnt-instance replace-disks
Dobrica Pavlinusic [Sat, 30 Sep 2017 13:33:33 +0000 (15:33 +0200)]
prefix output with machine name
Dobrica Pavlinusic [Wed, 27 Sep 2017 15:59:17 +0000 (17:59 +0200)]
fix interval skip
Dobrica Pavlinusic [Tue, 26 Sep 2017 18:45:46 +0000 (20:45 +0200)]
report power supply metrics
Dobrica Pavlinusic [Thu, 8 Jun 2017 11:24:14 +0000 (13:24 +0200)]
copy acls also to backup
Dobrica Pavlinusic [Sun, 28 May 2017 19:09:41 +0000 (21:09 +0200)]
Merge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Sun, 28 May 2017 19:07:31 +0000 (21:07 +0200)]
use snf-image to backup windows machines
root [Tue, 23 May 2017 07:31:42 +0000 (09:31 +0200)]
insert 0 for vlan if unknown
Dobrica Pavlinusic [Mon, 22 May 2017 20:43:33 +0000 (22:43 +0200)]
watch drbd and mdstat sync
Dobrica Pavlinusic [Mon, 22 May 2017 19:05:04 +0000 (21:05 +0200)]
rsync extended attrs to preserve capabilities
Dobrica Pavlinusic [Mon, 22 May 2017 19:04:08 +0000 (21:04 +0200)]
don't exit if wathed job failed
Dobrica Pavlinusic [Mon, 22 May 2017 19:03:02 +0000 (21:03 +0200)]
sleep only if needed
Dobrica Pavlinusic [Thu, 20 Apr 2017 13:23:22 +0000 (15:23 +0200)]
don't send data to influx from gnt-i
Dobrica Pavlinusic [Thu, 20 Apr 2017 13:21:59 +0000 (15:21 +0200)]
prefix measurement with ganeti_
This allows us to insert data into any database (e.g. telegraf)
Also check if wait if larger than inverval to fix negative time wait.
Dobrica Pavlinusic [Tue, 14 Feb 2017 11:35:53 +0000 (12:35 +0100)]
divide all values by $INTERVAL so we work for intervals which are > 1 s
Dobrica Pavlinusic [Mon, 13 Feb 2017 11:05:49 +0000 (12:05 +0100)]
sleep correct amount of time regardles of processing time
Dobrica Pavlinusic [Mon, 13 Feb 2017 10:36:11 +0000 (11:36 +0100)]
make timestamp integer
Dobrica Pavlinusic [Mon, 13 Feb 2017 10:34:11 +0000 (11:34 +0100)]
emit timestamp from local node
Dobrica Pavlinusic [Thu, 9 Feb 2017 13:47:48 +0000 (14:47 +0100)]
rsync --inplace helps copy-on-write filesystems
Dobrica Pavlinusic [Thu, 9 Feb 2017 13:47:22 +0000 (14:47 +0100)]
Merge branch 'master' of github.com:ffzg/gnt-info
Dobrica Pavlinušić [Thu, 9 Feb 2017 13:38:13 +0000 (14:38 +0100)]
Merge pull request #1 from mnalis/master
original code has problems: dies if no running jobs at the moment (gn…
Matija Nalis [Thu, 9 Feb 2017 12:15:58 +0000 (13:15 +0100)]
original code has problems: dies if no running jobs at the moment (gnt-job watch without args throws error), dies if more than one job running (gnt-job watch with multiple args throws error), hammers the server if no jobs are running (no sleep)
Dobrica Pavlinusic [Wed, 18 Jan 2017 10:58:15 +0000 (11:58 +0100)]
fix typo
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:50:57 +0000 (09:50 +0100)]
more robust optional offset mount
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:26:25 +0000 (09:26 +0100)]
sleep before mount, remove shell on finish
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:24:43 +0000 (09:24 +0100)]
umount snapshots, remove mount directories
Dobrica Pavlinusic [Fri, 23 Dec 2016 18:01:56 +0000 (19:01 +0100)]
remove mount directory which functions as a lock
this allows us to find failed backups and never re-try for that host
(until reboot of instance and /dev/shm cleanup that is)
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:50:49 +0000 (18:50 +0100)]
remove boot flag from fdisk output
this created wrong offset... sfdisk -d output is much nicer,
but I'm not sure that it's dependency is worth it
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:50:37 +0000 (18:50 +0100)]
read instance and disk from stdin with -
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:49:32 +0000 (18:49 +0100)]
remove snapshots on all nodes in cluster
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:32:13 +0000 (18:32 +0100)]
few careful shell escapes, and now it works
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:20:24 +0000 (18:20 +0100)]
mount first Linux partition
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:13:55 +0000 (18:13 +0100)]
specify instance and disk on command-line
Dobrica Pavlinusic [Fri, 23 Dec 2016 10:18:22 +0000 (11:18 +0100)]
mount snapshot read-only
Dobrica Pavlinusic [Fri, 23 Dec 2016 07:39:33 +0000 (08:39 +0100)]
create lv snapshot and rsync it to backup server
Dobrica Pavlinusic [Thu, 22 Dec 2016 21:49:11 +0000 (22:49 +0100)]
re-run watcher if there are more running jobs
Dobrica Pavlinusic [Wed, 21 Dec 2016 16:44:05 +0000 (17:44 +0100)]
remove haning snapshots after gnt-backup export
Dobrica Pavlinusic [Wed, 21 Dec 2016 12:38:44 +0000 (13:38 +0100)]
backup all running instances
Dobrica Pavlinusic [Tue, 20 Dec 2016 13:30:19 +0000 (14:30 +0100)]
move backup server to lib15
Dobrica Pavlinusic [Fri, 9 Dec 2016 19:18:38 +0000 (20:18 +0100)]
make-pidfile
Dobrica Pavlinusic [Fri, 9 Dec 2016 19:16:48 +0000 (20:16 +0100)]
added cpu ticks (not divided by vcpus as cpu is)
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:53:09 +0000 (19:53 +0100)]
create bug dumps on curl failure
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:52:52 +0000 (19:52 +0100)]
move location generation to avoid warning
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:34:45 +0000 (19:34 +0100)]
tags don't need quotes in influxdb, they are always ascii indexed in memory
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:32:58 +0000 (19:32 +0100)]
document bug and add workaround
Dobrica Pavlinusic [Fri, 9 Dec 2016 10:18:03 +0000 (11:18 +0100)]
fix vlan which can't be integer since it's tag
Dobrica Pavlinusic [Fri, 9 Dec 2016 09:41:39 +0000 (10:41 +0100)]
remove extra --background in stop which is not needed
Dobrica Pavlinusic [Fri, 9 Dec 2016 09:40:59 +0000 (10:40 +0100)]
stop/start gnt-monitor using systemd
Dobrica Pavlinusic [Thu, 8 Dec 2016 21:50:03 +0000 (22:50 +0100)]
systemd default file
Dobrica Pavlinusic [Thu, 8 Dec 2016 20:17:48 +0000 (21:17 +0100)]
rename systemd install script
Dobrica Pavlinusic [Thu, 8 Dec 2016 20:02:55 +0000 (21:02 +0100)]
sysv init now really works
Dobrica Pavlinusic [Thu, 8 Dec 2016 20:01:43 +0000 (21:01 +0100)]
include influx and debug info in pid
Dobrica Pavlinusic [Thu, 8 Dec 2016 19:01:04 +0000 (20:01 +0100)]
consistent naming, fix influx serialization
Dobrica Pavlinusic [Thu, 8 Dec 2016 17:57:29 +0000 (18:57 +0100)]
added DC and RACK location from enviroment
Dobrica Pavlinusic [Thu, 8 Dec 2016 17:40:25 +0000 (18:40 +0100)]
sysv install
Dobrica Pavlinusic [Thu, 8 Dec 2016 16:03:32 +0000 (17:03 +0100)]
sysv init script for gnt-monitor
Dobrica Pavlinusic [Thu, 8 Dec 2016 09:09:08 +0000 (10:09 +0100)]
reload systemd config, kill running gnt-monitors
Dobrica Pavlinusic [Thu, 8 Dec 2016 09:01:15 +0000 (10:01 +0100)]
always restart gnt-monitor
Dobrica Pavlinusic [Thu, 8 Dec 2016 08:35:06 +0000 (09:35 +0100)]
install gnt-monitor using systemd
Dobrica Pavlinusic [Wed, 7 Dec 2016 14:34:10 +0000 (15:34 +0100)]
skip non-existing or stopped instances
Dobrica Pavlinusic [Wed, 7 Dec 2016 10:56:50 +0000 (11:56 +0100)]
POST data to influxdb
Dobrica Pavlinusic [Wed, 7 Dec 2016 08:57:40 +0000 (09:57 +0100)]
added cpu usage % and show just changes
Dobrica Pavlinusic [Wed, 7 Dec 2016 07:16:30 +0000 (08:16 +0100)]
extract disk and network from running kvm instances
Dobrica Pavlinusic [Tue, 6 Dec 2016 11:34:52 +0000 (12:34 +0100)]
telegraf configuration for http endpoint at 127.0.0.1:8186
Dobrica Pavlinusic [Tue, 6 Dec 2016 11:32:45 +0000 (12:32 +0100)]
send ganeti starts to influxdb
Dobrica Pavlinusic [Mon, 5 Dec 2016 22:59:38 +0000 (23:59 +0100)]
add influx emitter using curl
Dobrica Pavlinusic [Fri, 2 Sep 2016 19:34:50 +0000 (21:34 +0200)]
use nfs
Dobrica Pavlinusic [Thu, 18 Aug 2016 09:55:20 +0000 (11:55 +0200)]
skip offline nodes
Dobrica Pavlinusic [Fri, 17 Jun 2016 19:53:04 +0000 (21:53 +0200)]
add vlan to instance
Dobrica Pavlinusic [Fri, 10 Jun 2016 13:36:15 +0000 (15:36 +0200)]
better output
Dobrica Pavlinusic [Mon, 28 Dec 2015 14:50:05 +0000 (15:50 +0100)]
watch running jobs
Dobrica Pavlinusic [Mon, 28 Dec 2015 14:48:34 +0000 (15:48 +0100)]
alias gc to gnt-cluster command
Dobrica Pavlinusic [Sat, 14 Nov 2015 20:22:27 +0000 (21:22 +0100)]
show sensors on all nodes in cluster
Dobrica Pavlinusic [Sat, 14 Nov 2015 18:24:30 +0000 (19:24 +0100)]
use different kernels
Dobrica Pavlinusic [Sun, 8 Mar 2015 21:49:18 +0000 (22:49 +0100)]
written size and compress ratio for snapshots
Dobrica Pavlinusic [Sun, 1 Mar 2015 14:21:49 +0000 (15:21 +0100)]
support more than one instance as argument
Dobrica Pavlinusic [Sun, 1 Mar 2015 12:44:04 +0000 (13:44 +0100)]
show instance info before start of export