Remote restore not working - "DB metadata not changed. database may already exist" #9593

pragmaticivan · 2018-03-16T17:26:24Z

Influxdb 1.5 (apine docker version)

backup

influxd backup -portable -database "metrics-cadvisor" -host $DATABASE_HOST:$DATABASE_PORT "$BACKUP_PATH"

influxd backup -portable -database "metrics-health" -host $DATABASE_HOST:$DATABASE_PORT "$BACKUP_PATH"

restore

influxd restore -portable -database "metrics-cadvisor"  -host $DATABASE_HOST:$DATABASE_PORT $BACKUP_PATH ;

influxd restore -portable -database "metrics-health" -host $DATABASE_HOST:$DATABASE_PORT $BACKUP_PATH

error

2018/03/16 17:18:44 error updating meta: DB metadata not changed. database may already exist
restore: DB metadata not changed. database may already exist
Restore failed

I tried deleting the 2 databases before restoring, and it didn't work.

The text was updated successfully, but these errors were encountered:

dgnorton · 2018-03-16T17:47:21Z

@pragmaticivan what version of InfluxDB are you running?

pragmaticivan · 2018-03-16T20:07:58Z

Just updated Influxdb 1.5 (apine docker version)

aanthony1243 · 2018-03-20T19:37:09Z

When restoring, the flag for database selection is '-db' not '-database', as in influxd restore -portable -db "metrics-health" -host $DATABASE_HOST:$DATABASE_PORT $BACKUP_PATH. We'll propose improving on the flags and track them in issue #9608

entone · 2018-04-11T14:58:59Z

I'm still getting this error regardless of the -db flag. Have tried -newdb as well and no db flag. Currently running 1.5.1

Error

2018/04/11 09:56:20 error updating meta: DB metadata not changed. database may already exist restore: DB metadata not changed. database may already exist

Backup Command

influxd backup -portable -database "brood" -host localhost:8088 ~/Desktop/influx-backup-test-2

Restore Command

influxd restore -portable -db "brood" -host localhost:8088 ~/Desktop/influx-backup-test-2

Restore -newdb

influxd restore -portable -newdb "brood1" -host localhost:8088 ~/Desktop/influx-backup-test-2

entone · 2018-04-11T15:07:23Z

The legacy backup and restore appear to work fine.

aanthony1243 · 2018-04-11T15:36:08Z

hi @entone , try:

influxd restore -portable -db "brood" -newdb "brood1" -host localhost:8088 ~/Desktop/influx-backup-test-2

to clarify:
-db identifies the database from the backup file that you want to restore.
-newdb indicates the name you want to give to the imported database. If not given, it will default to the same name as the original. However, you must restore to a unique db name. If the original db already exists in the system, then the restore will fail, which is why you need both -db and -newdb in this case.

bmailhe · 2018-04-12T14:58:26Z

Hi @aanthony1243

If the original db already exists in the system, then the restore will fail, which is why you need both -db and -newdb in this case.

Then can you explain me how to restore an incremental backup ?
Eg: every week I make a full backup, and every day an incremental one.
If I have to restore a full plus some increments, how to do it ?

aanthony1243 · 2018-04-13T01:41:22Z

Your commands above are not incremental, but always take a full backup. You can use either -since or -start and -end to extract the more recent data. (see https://docs.influxdata.com/influxdb/v1.5/administration/backup_and_restore/) Then for each incremental backup, you do as above and restore the incremental part to brood1. Then, after you've restored the incremental data to brood1, you can side-load it to the original DB and drop brood1:

>  use brood1
> SELECT * INTO brood..:MEASUREMENT FROM /.*/ GROUP BY *
> drop brood1

You can then repeat this process for each increment.

mcappadonna · 2018-05-09T10:21:48Z

Sorry @aanthony1243, this mean there isn't any way to restore incremental data without proceding db-by-db?

aanthony1243 · 2018-05-10T14:13:09Z

@mcappadonna yes, that's correct.

petetnt · 2018-05-17T12:01:40Z

I don't want to complain about open source software and I appreciate the work that you are doing a lot, but is incremental restore planned? Incremental backup without the means of restoring them incrementally without intensive manual work seems rather cumbersome and a huge oversight.

petetnt · 2018-06-13T06:51:15Z

We wrote this small script for restoring incremental backups so we won't be SOL when the worst happens

https://github.com/motleyagency/influxdb-incremental-restore

It's written in Node, so it might not be optimal for every case but has worked just fine for us.

kaiterramike · 2018-08-29T03:30:40Z

Also to add onto @petetnt's great work, before you do the SELECT * INTO operation suggested by @aanthony1243, you'll want to:

disable query-timeout
set some sort of CPU limit on influxdb so it doesn't render the machine completely unusable during the restore

I just tried an incremental restore of about 1.6M data points on a 2 core machine, and it took about 2 minutes. This operation is slower than the initial /usr/bin/influxd restore command by one or two orders of magnitude, which makes sense.

Edit: We're solidly in the "low" category on this page -- 18k series with appropriately sized hardware -- and doing this SELECT * INTO query on a single day's incremental backup takes down the machine by eating all the RAM. I'm afraid I'm going to have to invest a couple more days and write some custom code to stream rows between databases in a "nice" way -- chunking by series and by shard. I've looked into Kapacitor and export/import (data becomes too large on disk) and neither of them solve the problem.

Edit2: I found a way to merge everything from one database into another: pipe the output of an export command directly to an import command. Uses an extra ~500MB of RAM on my machine.

DB_FROM=r_air
DB_TO=r_air2
fifo_name=fifo-${DB_FROM}-to-${DB_TO}
mkfifo $fifo_name
influx_inspect export -datadir /var/lib/influxdb/data -waldir /var/lib/influxdb/wal -database $DB_FROM -out $fifo_name &
cat $fifo_name \
    | sed -e "s/^CREATE DATABASE ${DB_FROM} WITH/CREATE DATABASE ${DB_TO} WITH/" \
    | sed -e "s/^# CONTEXT-DATABASE:${DB_FROM}$/# CONTEXT-DATABASE:${DB_TO}/" \
    | influx -import -path /dev/stdin
rm $fifo_name

Note that you can't use -out /dev/stdout on the influx_inspect command because it already writes random stuff to stdout, which messes up influx -import. A named pipe is required.

The only downside: it's extremely slow. The number of lines to process is somewhere around 3/4 the number of bytes in the gzipped backup, so a 1GB backup will require processing about 750M lines, and at 100k lines/sec on a modest machine, that's about 2 hours.

lafrech · 2020-12-17T10:30:38Z

Currently struggling with restore/backup, I'm also surprised that I can't restore a backup over an existing database. I expected new data to be added and existing data to be overwritten.

I thought this was what is meant in the docs by

If duplicate data points are included in the backup files, the points will be written again, overwriting any existing data.

Perhaps the docs refer to duplicate data points in the backup itself (although I wonder how that could happen in the first place).

Please consider this a feature request then. The lack of an ability to restore a partial DB file into an existing database is an important shortcoming.

Use case: I recently backed up / restored databases to migrate an InfluxDB instance and something went wrong, I guess, because new databases lack chunks of data. After a few days, I made new backups and would like to complete the new databases. I can't afford to replace from scratch because new databases have been recording new data since the migration.

I've been trying the workaround proposed by @aanthony1243 in #9593 (comment):

use brood1
SELECT * INTO brood..:MEASUREMENT FROM /.*/ GROUP BY *
drop brood1

Unfortunately, it seems only measurements of numerical type are copied (#18132) so it does not provide a full backup/restore solution.

At this point, I have data in two different databases and I can't find a way to merge data from one into another.

kaiterramike · 2020-12-18T03:11:09Z

@lafrech the InfluxDB team has moved on to version 2.0, so you won't see a feature like this added to 1.x.

Consider using the hack I mentioned a few comments ago to merge data from one database to another.

dgnorton added the need more info label Mar 16, 2018

dgnorton added kind/bug and removed need more info labels Mar 16, 2018

e-dard assigned aanthony1243 Mar 20, 2018

aanthony1243 closed this as completed Mar 20, 2018

hahnjo mentioned this issue May 22, 2018

SELECT INTO cannot handle fields with same name and different data types #9887

Closed

petetnt mentioned this issue Oct 18, 2018

Shard restoration is painfully slow motleyagency/influxdb-incremental-restore#10

Open

lafrech mentioned this issue Dec 17, 2020

SELECT * INTO does not copy all measurements #18132

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remote restore not working - "DB metadata not changed. database may already exist" #9593

Remote restore not working - "DB metadata not changed. database may already exist" #9593

pragmaticivan commented Mar 16, 2018 •

edited

Loading

dgnorton commented Mar 16, 2018

pragmaticivan commented Mar 16, 2018

aanthony1243 commented Mar 20, 2018

entone commented Apr 11, 2018

entone commented Apr 11, 2018

aanthony1243 commented Apr 11, 2018

bmailhe commented Apr 12, 2018

aanthony1243 commented Apr 13, 2018

mcappadonna commented May 9, 2018

aanthony1243 commented May 10, 2018

petetnt commented May 17, 2018

petetnt commented Jun 13, 2018

kaiterramike commented Aug 29, 2018 •

edited

Loading

lafrech commented Dec 17, 2020 •

edited

Loading

kaiterramike commented Dec 18, 2020

Remote restore not working - "DB metadata not changed. database may already exist" #9593

Remote restore not working - "DB metadata not changed. database may already exist" #9593

Comments

pragmaticivan commented Mar 16, 2018 • edited Loading

backup

restore

error

dgnorton commented Mar 16, 2018

pragmaticivan commented Mar 16, 2018

aanthony1243 commented Mar 20, 2018

entone commented Apr 11, 2018

entone commented Apr 11, 2018

aanthony1243 commented Apr 11, 2018

bmailhe commented Apr 12, 2018

aanthony1243 commented Apr 13, 2018

mcappadonna commented May 9, 2018

aanthony1243 commented May 10, 2018

petetnt commented May 17, 2018

petetnt commented Jun 13, 2018

kaiterramike commented Aug 29, 2018 • edited Loading

lafrech commented Dec 17, 2020 • edited Loading

kaiterramike commented Dec 18, 2020

pragmaticivan commented Mar 16, 2018 •

edited

Loading

kaiterramike commented Aug 29, 2018 •

edited

Loading

lafrech commented Dec 17, 2020 •

edited

Loading