gnt-info
6 years agocheck if we need to re-run smart tests
Dobrica Pavlinusic [Wed, 17 Jan 2018 16:40:00 +0000 (17:40 +0100)]
check if we need to re-run smart tests

6 years agobetter detect megaraid controllers
Dobrica Pavlinusic [Wed, 17 Jan 2018 10:57:07 +0000 (11:57 +0100)]
better detect megaraid controllers

6 years agoMerge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Mon, 15 Jan 2018 18:34:13 +0000 (19:34 +0100)]
Merge branch 'master' of https://github.com/ffzg/gnt-info

6 years agogenerate full report by default
Dobrica Pavlinusic [Mon, 15 Jan 2018 18:32:58 +0000 (19:32 +0100)]
generate full report by default

just add '^# 1' as arument to get quick report

6 years agoreport test remaining if self-test is running
Dobrica Pavlinusic [Mon, 15 Jan 2018 16:52:26 +0000 (17:52 +0100)]
report test remaining if self-test is running

6 years agoMerge remote-tracking branch 'lib15/lib15'
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:46:11 +0000 (14:46 +0100)]
Merge remote-tracking branch 'lib15/lib15'

6 years agouse smartctl on megaraid controller only once
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:41:26 +0000 (14:41 +0100)]
use smartctl on megaraid controller only once

If you export multiple vds from controller, they will all have
full set of disks attached from smartctl's point of view, so there
is no need to run this more than once

6 years agoreport smart for whole cluster
Dobrica Pavlinusic [Sun, 14 Jan 2018 13:23:40 +0000 (14:23 +0100)]
report smart for whole cluster

6 years agoexecute SMART smartctl params on each drive
Dobrica Pavlinusic [Sun, 14 Jan 2018 12:55:46 +0000 (13:55 +0100)]
execute SMART smartctl params on each drive

e.g. SMART="-t long" ./smart-megaraid.sh

6 years agosmart script is not ganeti specific
Dobrica Pavlinusic [Sun, 14 Jan 2018 12:28:42 +0000 (13:28 +0100)]
smart script is not ganeti specific

6 years agoreport smart status from megaraid controller
Dobrica Pavlinusic [Sun, 14 Jan 2018 11:51:25 +0000 (12:51 +0100)]
report smart status from megaraid controller

6 years agodisplay last selftest log for each disk
Dobrica Pavlinusic [Sun, 14 Jan 2018 10:29:02 +0000 (11:29 +0100)]
display last selftest log for each disk

6 years agoretrive rsync.args from rsync server for each instance if it exists
Dobrica Pavlinusic [Tue, 5 Dec 2017 12:08:10 +0000 (13:08 +0100)]
retrive args from rsync server for each instance if it exists

This allows us server-side --exclude 'var/lib/php5/sess_*' and similar
directives which can't really be specified in server rsyncd.conf beacause
it will produce rsync error, and not remove snapshot at end
(and prevent future backups)

6 years agodisplay vnc connection info over ssh for instance
Dobrica Pavlinusic [Tue, 7 Nov 2017 07:32:30 +0000 (08:32 +0100)]
display vnc connection info over ssh for instance

6 years agoask ganeti for vg, execute zfs snap using ssh command=
Dobrica Pavlinusic [Thu, 5 Oct 2017 21:13:27 +0000 (23:13 +0200)]
ask ganeti for vg, execute zfs snap using ssh command=

6 years agouse backup enviroment var for top-level backup dir
Dobrica Pavlinusic [Thu, 5 Oct 2017 16:44:49 +0000 (18:44 +0200)]
use backup enviroment var for top-level backup dir

6 years agofind instances running with closed lvs which require gnt-instance replace-disks
Dobrica Pavlinusic [Sun, 1 Oct 2017 07:03:17 +0000 (09:03 +0200)]
find instances running with closed lvs which require gnt-instance replace-disks

6 years agoprefix output with machine name
Dobrica Pavlinusic [Sat, 30 Sep 2017 13:33:33 +0000 (15:33 +0200)]
prefix output with machine name

6 years agofix interval skip
Dobrica Pavlinusic [Wed, 27 Sep 2017 15:59:17 +0000 (17:59 +0200)]
fix interval skip

6 years agoreport power supply metrics
Dobrica Pavlinusic [Tue, 26 Sep 2017 18:45:46 +0000 (20:45 +0200)]
report power supply metrics

6 years agocopy acls also to backup
Dobrica Pavlinusic [Thu, 8 Jun 2017 11:24:14 +0000 (13:24 +0200)]
copy acls also to backup

6 years agoMerge branch 'master' of https://github.com/ffzg/gnt-info
Dobrica Pavlinusic [Sun, 28 May 2017 19:09:41 +0000 (21:09 +0200)]
Merge branch 'master' of https://github.com/ffzg/gnt-info

6 years agouse snf-image to backup windows machines
Dobrica Pavlinusic [Sun, 28 May 2017 19:07:31 +0000 (21:07 +0200)]
use snf-image to backup windows machines

6 years agoinsert 0 for vlan if unknown
root [Tue, 23 May 2017 07:31:42 +0000 (09:31 +0200)]
insert 0 for vlan if unknown

6 years agowatch drbd and mdstat sync
Dobrica Pavlinusic [Mon, 22 May 2017 20:43:33 +0000 (22:43 +0200)]
watch drbd and mdstat sync

6 years agorsync extended attrs to preserve capabilities
Dobrica Pavlinusic [Mon, 22 May 2017 19:05:04 +0000 (21:05 +0200)]
rsync extended attrs to preserve capabilities

6 years agodon't exit if wathed job failed
Dobrica Pavlinusic [Mon, 22 May 2017 19:04:08 +0000 (21:04 +0200)]
don't exit if wathed job failed

6 years agosleep only if needed
Dobrica Pavlinusic [Mon, 22 May 2017 19:03:02 +0000 (21:03 +0200)]
sleep only if needed

7 years agodon't send data to influx from gnt-i
Dobrica Pavlinusic [Thu, 20 Apr 2017 13:23:22 +0000 (15:23 +0200)]
don't send data to influx from gnt-i

7 years agoprefix measurement with ganeti_
Dobrica Pavlinusic [Thu, 20 Apr 2017 13:21:59 +0000 (15:21 +0200)]
prefix measurement with ganeti_

This allows us to insert data into any database (e.g. telegraf)

Also check if wait if larger than inverval to fix negative time wait.

7 years agodivide all values by $INTERVAL so we work for intervals which are > 1 s
Dobrica Pavlinusic [Tue, 14 Feb 2017 11:35:53 +0000 (12:35 +0100)]
divide all values by $INTERVAL so we work for intervals which are > 1 s

7 years agosleep correct amount of time regardles of processing time
Dobrica Pavlinusic [Mon, 13 Feb 2017 11:05:49 +0000 (12:05 +0100)]
sleep correct amount of time regardles of processing time

7 years agomake timestamp integer
Dobrica Pavlinusic [Mon, 13 Feb 2017 10:36:11 +0000 (11:36 +0100)]
make timestamp integer

7 years agoemit timestamp from local node
Dobrica Pavlinusic [Mon, 13 Feb 2017 10:34:11 +0000 (11:34 +0100)]
emit timestamp from local node

7 years agorsync --inplace helps copy-on-write filesystems
Dobrica Pavlinusic [Thu, 9 Feb 2017 13:47:48 +0000 (14:47 +0100)]
rsync --inplace helps copy-on-write filesystems

7 years agoMerge branch 'master' of github.com:ffzg/gnt-info
Dobrica Pavlinusic [Thu, 9 Feb 2017 13:47:22 +0000 (14:47 +0100)]
Merge branch 'master' of github.com:ffzg/gnt-info

7 years agoMerge pull request #1 from mnalis/master
Dobrica Pavlinušić [Thu, 9 Feb 2017 13:38:13 +0000 (14:38 +0100)]
Merge pull request #1 from mnalis/master

original code has problems: dies if no running jobs at the moment (gn…

7 years agooriginal code has problems: dies if no running jobs at the moment (gnt-job watch...
Matija Nalis [Thu, 9 Feb 2017 12:15:58 +0000 (13:15 +0100)]
original code has problems: dies if no running jobs at the moment (gnt-job watch without args throws error), dies if more than one job running (gnt-job watch with multiple args throws error), hammers the server if no jobs are running (no sleep)

7 years agofix typo
Dobrica Pavlinusic [Wed, 18 Jan 2017 10:58:15 +0000 (11:58 +0100)]
fix typo

7 years agomore robust optional offset mount
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:50:57 +0000 (09:50 +0100)]
more robust optional offset mount

7 years agosleep before mount, remove shell on finish
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:26:25 +0000 (09:26 +0100)]
sleep before mount, remove shell on finish

7 years agoumount snapshots, remove mount directories
Dobrica Pavlinusic [Sat, 24 Dec 2016 08:24:43 +0000 (09:24 +0100)]
umount snapshots, remove mount directories

7 years agoremove mount directory which functions as a lock
Dobrica Pavlinusic [Fri, 23 Dec 2016 18:01:56 +0000 (19:01 +0100)]
remove mount directory which functions as a lock

this allows us to find failed backups and never re-try for that host
(until reboot of instance and /dev/shm cleanup that is)

7 years agoremove boot flag from fdisk output
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:50:49 +0000 (18:50 +0100)]
remove boot flag from fdisk output

this created wrong offset... sfdisk -d output is much nicer,
but I'm not sure that it's dependency is worth it

7 years agoread instance and disk from stdin with -
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:50:37 +0000 (18:50 +0100)]
read instance and disk from stdin with -

7 years agoremove snapshots on all nodes in cluster
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:49:32 +0000 (18:49 +0100)]
remove snapshots on all nodes in cluster

7 years agofew careful shell escapes, and now it works
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:32:13 +0000 (18:32 +0100)]
few careful shell escapes, and now it works

7 years agomount first Linux partition
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:20:24 +0000 (18:20 +0100)]
mount first Linux partition

7 years agospecify instance and disk on command-line
Dobrica Pavlinusic [Fri, 23 Dec 2016 17:13:55 +0000 (18:13 +0100)]
specify instance and disk on command-line

7 years agomount snapshot read-only
Dobrica Pavlinusic [Fri, 23 Dec 2016 10:18:22 +0000 (11:18 +0100)]
mount snapshot read-only

7 years agocreate lv snapshot and rsync it to backup server
Dobrica Pavlinusic [Fri, 23 Dec 2016 07:39:33 +0000 (08:39 +0100)]
create lv snapshot and rsync it to backup server

7 years agore-run watcher if there are more running jobs
Dobrica Pavlinusic [Thu, 22 Dec 2016 21:49:11 +0000 (22:49 +0100)]
re-run watcher if there are more running jobs

7 years agoremove haning snapshots after gnt-backup export
Dobrica Pavlinusic [Wed, 21 Dec 2016 16:44:05 +0000 (17:44 +0100)]
remove haning snapshots after gnt-backup export

7 years agobackup all running instances
Dobrica Pavlinusic [Wed, 21 Dec 2016 12:38:44 +0000 (13:38 +0100)]
backup all running instances

7 years agomove backup server to lib15
Dobrica Pavlinusic [Tue, 20 Dec 2016 13:30:19 +0000 (14:30 +0100)]
move backup server to lib15

7 years agomake-pidfile
Dobrica Pavlinusic [Fri, 9 Dec 2016 19:18:38 +0000 (20:18 +0100)]
make-pidfile

7 years agoadded cpu ticks (not divided by vcpus as cpu is)
Dobrica Pavlinusic [Fri, 9 Dec 2016 19:16:48 +0000 (20:16 +0100)]
added cpu ticks (not divided by vcpus as cpu is)

7 years agocreate bug dumps on curl failure
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:53:09 +0000 (19:53 +0100)]
create bug dumps on curl failure

7 years agomove location generation to avoid warning
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:52:52 +0000 (19:52 +0100)]
move location generation to avoid warning

7 years agotags don't need quotes in influxdb, they are always ascii indexed in memory
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:34:45 +0000 (19:34 +0100)]
tags don't need quotes in influxdb, they are always ascii indexed in memory

7 years agodocument bug and add workaround
Dobrica Pavlinusic [Fri, 9 Dec 2016 18:32:58 +0000 (19:32 +0100)]
document bug and add workaround

7 years agofix vlan which can't be integer since it's tag
Dobrica Pavlinusic [Fri, 9 Dec 2016 10:18:03 +0000 (11:18 +0100)]
fix vlan which can't be integer since it's tag

7 years agoremove extra --background in stop which is not needed
Dobrica Pavlinusic [Fri, 9 Dec 2016 09:41:39 +0000 (10:41 +0100)]
remove extra --background in stop which is not needed

7 years agostop/start gnt-monitor using systemd
Dobrica Pavlinusic [Fri, 9 Dec 2016 09:40:59 +0000 (10:40 +0100)]
stop/start gnt-monitor using systemd

7 years agosystemd default file
Dobrica Pavlinusic [Thu, 8 Dec 2016 21:50:03 +0000 (22:50 +0100)]
systemd default file

7 years agorename systemd install script
Dobrica Pavlinusic [Thu, 8 Dec 2016 20:17:48 +0000 (21:17 +0100)]
rename systemd install script

7 years agosysv init now really works
Dobrica Pavlinusic [Thu, 8 Dec 2016 20:02:55 +0000 (21:02 +0100)]
sysv init now really works

7 years agoinclude influx and debug info in pid
Dobrica Pavlinusic [Thu, 8 Dec 2016 20:01:43 +0000 (21:01 +0100)]
include influx and debug info in pid

7 years agoconsistent naming, fix influx serialization
Dobrica Pavlinusic [Thu, 8 Dec 2016 19:01:04 +0000 (20:01 +0100)]
consistent naming, fix influx serialization

7 years agoadded DC and RACK location from enviroment
Dobrica Pavlinusic [Thu, 8 Dec 2016 17:57:29 +0000 (18:57 +0100)]
added DC and RACK location from enviroment

7 years agosysv install
Dobrica Pavlinusic [Thu, 8 Dec 2016 17:40:25 +0000 (18:40 +0100)]
sysv install

7 years agosysv init script for gnt-monitor
Dobrica Pavlinusic [Thu, 8 Dec 2016 16:03:32 +0000 (17:03 +0100)]
sysv init script for gnt-monitor

7 years agoreload systemd config, kill running gnt-monitors
Dobrica Pavlinusic [Thu, 8 Dec 2016 09:09:08 +0000 (10:09 +0100)]
reload systemd config, kill running gnt-monitors

7 years agoalways restart gnt-monitor
Dobrica Pavlinusic [Thu, 8 Dec 2016 09:01:15 +0000 (10:01 +0100)]
always restart gnt-monitor

7 years agoinstall gnt-monitor using systemd
Dobrica Pavlinusic [Thu, 8 Dec 2016 08:35:06 +0000 (09:35 +0100)]
install gnt-monitor using systemd

7 years agoskip non-existing or stopped instances
Dobrica Pavlinusic [Wed, 7 Dec 2016 14:34:10 +0000 (15:34 +0100)]
skip non-existing or stopped instances

7 years agoPOST data to influxdb
Dobrica Pavlinusic [Wed, 7 Dec 2016 10:56:50 +0000 (11:56 +0100)]
POST data to influxdb

7 years agoadded cpu usage % and show just changes
Dobrica Pavlinusic [Wed, 7 Dec 2016 08:57:40 +0000 (09:57 +0100)]
added cpu usage % and show just changes

7 years agoextract disk and network from running kvm instances
Dobrica Pavlinusic [Wed, 7 Dec 2016 07:16:30 +0000 (08:16 +0100)]
extract disk and network from running kvm instances

7 years agotelegraf configuration for http endpoint at 127.0.0.1:8186
Dobrica Pavlinusic [Tue, 6 Dec 2016 11:34:52 +0000 (12:34 +0100)]
telegraf configuration for http endpoint at 127.0.0.1:8186

7 years agosend ganeti starts to influxdb
Dobrica Pavlinusic [Tue, 6 Dec 2016 11:32:45 +0000 (12:32 +0100)]
send ganeti starts to influxdb

7 years agoadd influx emitter using curl
Dobrica Pavlinusic [Mon, 5 Dec 2016 22:59:38 +0000 (23:59 +0100)]
add influx emitter using curl

7 years agouse nfs
Dobrica Pavlinusic [Fri, 2 Sep 2016 19:34:50 +0000 (21:34 +0200)]
use nfs

7 years agoskip offline nodes helpers
Dobrica Pavlinusic [Thu, 18 Aug 2016 09:55:20 +0000 (11:55 +0200)]
skip offline nodes

7 years agoadd vlan to instance
Dobrica Pavlinusic [Fri, 17 Jun 2016 19:53:04 +0000 (21:53 +0200)]
add vlan to instance

7 years agobetter output
Dobrica Pavlinusic [Fri, 10 Jun 2016 13:36:15 +0000 (15:36 +0200)]
better output

8 years agowatch running jobs
Dobrica Pavlinusic [Mon, 28 Dec 2015 14:50:05 +0000 (15:50 +0100)]
watch running jobs

8 years agoalias gc to gnt-cluster command
Dobrica Pavlinusic [Mon, 28 Dec 2015 14:48:34 +0000 (15:48 +0100)]
alias gc to gnt-cluster command

8 years agoshow sensors on all nodes in cluster
Dobrica Pavlinusic [Sat, 14 Nov 2015 20:22:27 +0000 (21:22 +0100)]
show sensors on all nodes in cluster

8 years agouse different kernels
Dobrica Pavlinusic [Sat, 14 Nov 2015 18:24:30 +0000 (19:24 +0100)]
use different kernels

9 years agowritten size and compress ratio for snapshots
Dobrica Pavlinusic [Sun, 8 Mar 2015 21:49:18 +0000 (22:49 +0100)]
written size and compress ratio for snapshots

9 years agosupport more than one instance as argument
Dobrica Pavlinusic [Sun, 1 Mar 2015 14:21:49 +0000 (15:21 +0100)]
support more than one instance as argument

9 years agoshow instance info before start of export
Dobrica Pavlinusic [Sun, 1 Mar 2015 12:44:04 +0000 (13:44 +0100)]
show instance info before start of export

9 years agochange instance kernel to 3.10
Dobrica Pavlinusic [Sat, 28 Feb 2015 13:08:14 +0000 (14:08 +0100)]
change instance kernel to 3.10

9 years agolist or create zfs snaphost from gnt-backup export
Dobrica Pavlinusic [Sat, 28 Feb 2015 13:01:07 +0000 (14:01 +0100)]
list or create zfs snaphost from gnt-backup export

9 years agoalways return filename from grep
Dobrica Pavlinusic [Thu, 29 Jan 2015 12:21:06 +0000 (13:21 +0100)]
always return filename from grep

9 years agocleanup dump of mac instance mappings to STDERR
Dobrica Pavlinusic [Wed, 28 Jan 2015 22:33:58 +0000 (23:33 +0100)]
cleanup dump of mac instance mappings to STDERR

9 years agoadded network rx tx for each instance
Dobrica Pavlinusic [Wed, 28 Jan 2015 22:18:31 +0000 (23:18 +0100)]
added network rx tx for each instance

9 years agofind tap device for instance on given node
Dobrica Pavlinusic [Wed, 28 Jan 2015 21:23:11 +0000 (22:23 +0100)]
find tap device for instance on given node

9 years agoadded node column
Dobrica Pavlinusic [Wed, 28 Jan 2015 19:34:53 +0000 (20:34 +0100)]
added node column