Skip to content

ShanechiLab/PSID

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

(I)PSID: (Input) Preferential subspace identification
[MATLAB implementation]

For Python implementation see http://github.com/ShanechiLab/PyPSID

Given signals y_t (e.g. neural signals) and z_t (e.g behavior), PSID learns a dynamic model for y_t while prioritizing the dynamics that are relevant to z_t.

IPSID is an extension of PSID that also supports taking a third signal u_t (e.g., task instructions) that is simultaneously measured with y_t. In the learned dynamical model, u_t plays the role of input to the latet states.

Publications

PSID

For the derivation of PSID and results in real neural data see the paper below.

Omid G. Sani, Hamidreza Abbaspourazad, Yan T. Wong, Bijan Pesaran, Maryam M. Shanechi. Modeling behaviorally relevant neural dynamics enabled by preferential subspace identification. Nature Neuroscience, 24, 140–149 (2021). https://doi.org/10.1038/s41593-020-00733-0

View-only full-text link: https://rdcu.be/b993t

Original preprint: https://doi.org/10.1101/808154

You can also find a summary of the paper in the following Twitter thread: https://twitter.com/MaryamShanechi/status/1325835609345122304

IPSID

For the derivation of IPSID and results in real neural data see the paper below.

Parsa Vahidi*, Omid G. Sani*, Maryam M. Shanechi. Modeling and dissociation of intrinsic and input-driven neural population dynamics underlying behavior. PNAS (2024). https://doi.org/10.1073/pnas.2212887121

Usage guide

Initialization

Add the source directory and its subdirectories to the path. You can run init.m to do this.

Main learning function

The main functions for the MATLAB implementation are the following:

A complete usage guide is available in the function. The following shows an example case:

idSys = PSID(y, z, nx, n1, i);
# Or, if modeling effect of input u is also of interest
idSys = IPSID(y, z, u, nx, n1, i);

Inputs:

  • y and z are (time x dimension) matrices with neural (e.g. LFP signal powers or spike counts) and behavioral data (e.g. joint angles, hand position, etc), respectively.
  • IPSID also takes u as an input, which is a (dimension x time) matrix, containing the measured input data.
  • Note that IPSID currently expects data to be provided as (dimension x time) while PSID supports both default (time x dimension) and (dimension x time)
  • nx is the total number of latent states to be identified.
  • n1 is the number of states that are going to be dedicated to behaviorally relevant dynamics.
  • i is the subspace horizon used for modeling.

Output:

  • idSys: a structure containing all model parameters (A, Cy, Cz, etc). For a full list see the code.

Extracting latent states using learned model

Once a model is learned using (I)PSID, you can apply the model to new data (i.e. run the associated Kalman filter) as follows:

[zPred, yPred, xPred] = PSIDPredict(idSys, y);
# Or, for IPSID:
[zPred, yPred, xPred] = PSIDPredict(idSys, y, u);

Input:

  • y: neural activity time series (time x dimension) for PSID, or (dimension x time) for IPSID
  • [For IPSID] u: input time series (dimension x time) Outputs:
  • zPred: one-step ahead prediction of behavior (if any)
  • yPred: one-step ahead prediction of neural activity
  • xPred: Extracted latent state

Required preprocessing

  • Repeated data dimensions (e.g., two identical neurons) can cause issues for the learning. Remove repeated data dimensions as a preprocessing and repeat predictions as needed to reproduce prediction of repeated data dimensions.
  • A required preprocessing when using (I)PSID is to remove the mean of neural/behavior/input signals and if needed, add them back to predictions after learning the model. Starting from version 1.1.0, Python and MATLAB PSID libraries automatically do this by default so that users won't need to worry about it. Please update to the latest version if you are using an older version.

Choosing the hyperparameters

How to pick the state dimensions nx and n1?

nx determines the total dimension of the latent state and n1 determines how many of those dimensions will be prioritizing the inclusion of behaviorally relevant neural dynamics (i.e. will be extracted using stage 1 of (I)PSID). So the values that you would select for these hyperparameters depend on the goal of modeling and on the data. Some examples use cases are:

If you want to perform dimension reduction, nx will be your desired target dimension. For example, to reduce dimension to 2 to plot low-dimensional visualizations of neural activity, you would use nx=2. Now if you want to reduce dimension while preserving as much behaviorally relevant neural dynamics as possible, you would use n1=nx. If you want to find the best fit to data overall, you can perform a grid search over values of nx and n1 and pick the value that achieves the best performance metric in the training data. For example, you could pick the nx and n1 pair that achieves the best cross-validated behavior decoding in an inner-cross-validation within the training data.

How to pick the horizon i?

The horizon i does not affect the model structure and only affects the intermediate linear algebra operations that (I)PSID performs during the learning of the model. Nevertheless, different values of i may have different model learning performance. i needs to be at least 2, but also also determines the maximum n1 and nx that can be used per:

n1 <= nz * i
nx <= ny * i

So if you have a low dimensional y_k or z_k (small ny or nz), you typically would choose larger values for i, and vice versa. It is also possible to select the best performing i via an inner cross-validation approach similar to nx and n1 above. Overall, since i affects the learning performance, it is important for reproducibility that the i that was used is reported.

For more information, see the notebook(s) referenced in the next section.

Usage examples

PSID

Example simulated data and the script for running PSID on the data is provided in example/example.m This script performs PSID model identification and visualizes the learned eigenvalues similar to in Supplementary Fig 1 in (Sani et al 2021)

The following notebook also contains some examples with the Python implementation: https://github.com/ShanechiLab/PyPSID/blob/main/source/PSID/example/PSID_tutorial.ipynb

IPSID

Example simulated data and the script for running PSID on the data is provided in example/IPSID_example.m This script performs IPSID model identification and visualizes the learned eigenvalues similar to in Fig. 2A in (Vahidi, Sani, et al, 2024).

The following notebook also contains some examples with the Python implementation: https://github.com/ShanechiLab/PyPSID/blob/main/source/PSID/example/IPSID_tutorial.ipynb

Change Log

You can see the change log in in ChangeLog.md

Licence

Copyright (c) 2020 University of Southern California
See full notice in LICENSE.md
Omid G. Sani, Parsa Vahidi and Maryam M. Shanechi
Shanechi Lab, University of Southern California