BitMEXDB

Bitmex Bulk Tick Data Crawler

This program downloads tick data from public.bitmex.com

options

timeframes

desired tick-resampling timeframes. use pandas's representation such as "1D", "1H", "1T"

db_path

desired location for the db

start_date

db crawling start date. if the db exists, it automatically detects it. if not, defaults to 20141122; not recommended to change

reset_db

set True if you want to reset(erase) db and make a new one

chromedriver_loc

your selenium chrome driver location

usage

python main.py [options]
-> the public.bitmex.com shows up.
-> if you see the list of csv.gz files on the webpage, click "yes"
-> the download starts

query

the table names are formatted as: SYMBOL_TIMEFRAME
Available TIMEFRAME are TICK, and what you have specified in option --timeframes
Available SYMBOL are in table TICKERS

def load_bitmex_data(db_path, timeframe, symbol):
    db = sqlite3.connect(db_path)

    if timeframe == "TICK":
        columns = ["timestamp", "symbol", "side", "size", "price", "tickDirection", "trdMatchID", "grossValue", "homeNotional", "foreignNotional"]
    else:
        columns = ["timestamp", "open", "high", "low", "close", "volume", "lowFirst"]
        
    df = pd.DataFrame(db.execute(f"SELECT * FROM {symbol}_{timeframe}"), columns=columns)
    df.index = pd.to_datetime(df["timestamp"])

    return df

df = load_bitmex_data("/home/ych/Storage/bitmex.db", "1T", "XBTUSD") # loads XBTUSD_1T table which has 1min candlesticks of XBTUSD

tips

when initializing the DB, it can take long (like 10~15 hours)
due to crawling restrictions, the download gets slower and slower. it's normal, but it can be faster if you just pause (shut down) and resume (rerun) the script within 30min~1 hrs.
using the function load_bitmex_data, you can easliy query the desired dataset.
please enjoy

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BitMEXDB

Bitmex Bulk Tick Data Crawler

options

timeframes

db_path

start_date

reset_db

chromedriver_loc

usage

query

tips

About

Releases

Packages

Languages

License

Yeachan-Heo/BitMEXDB

Folders and files

Latest commit

History

Repository files navigation

BitMEXDB

Bitmex Bulk Tick Data Crawler

options

timeframes

db_path

start_date

reset_db

chromedriver_loc

usage

query

tips

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages