Load the local file if available, otherwise download it first.
:param as_frame: Whether to return the data as a pandas DataFrame or not.
:type as_frame: bool, default=False
Load the local file and raise error if it is not available.
:param as_frame: Whether to return the data as a pandas DataFrame or not.
:type as_frame: bool, default=False
Load the local files if available, otherwise download them first.
:param as_frame: Whether to return the data as a pandas DataFrames or numpy arrays.
:type as_frame: bool, default=False
Returns:
X, y – Where Data is either pandas.DataFrame or numpy.ndarray.
A tuple with the list of feature matrices for each axis and their
corresponding interaction matrix.
Load the local files and raise error if one is not available.
:param as_frame: Whether to return the data as a pandas DataFrames or numpy arrays.
:type as_frame: bool, default=False
Returns:
X, y – Where Data is either pandas.DataFrame or numpy.ndarray.
A tuple with the list of feature matrices for each axis and their
corresponding interaction matrix.
Binary interaction prediction between enzymes and drug molecules.
This is one of four gold-standand datasets for drug-protein interaction
prediction introduced by Yamanishi et al., 2008 [1]_.
The input features are similarity matrices among the instances on each
axis and the target matrix is a binary interaction matrix determining the
the existence of an experimentally validated interaction (value 1) or the
absence of information about an interaction (value 0).
The score of a Smith-Waterman pairwise alignment is taken as the the
similarity between proteins, whereas the SIMCOMP score is used for the
similarity between drug molecules.
Minoru Kanehisa, Prediction of drug–target interaction networks from the
integration of chemical and genomic spaces, Bioinformatics, Volume 24, Issue
13, July 2008, Pages i232–i240,
https://doi.org/10.1093/bioinformatics/btn162
Binary interactions between G-protein coupled receptors and drug molecules.
This is one of four gold-standand datasets for drug-protein interaction
prediction introduced by Yamanishi et al., 2008 [1]_.
The input features are similarity matrices among the instances on each
axis and the target matrix is a binary interaction matrix determining the
the existence of an experimentally validated interaction (value 1) or the
absence of information about an interaction (value 0).
The score of a Smith-Waterman pairwise alignment is taken as the the
similarity between proteins, whereas the SIMCOMP score is used for the
similarity between drug molecules.
Minoru Kanehisa, Prediction of drug–target interaction networks from the
integration of chemical and genomic spaces, Bioinformatics, Volume 24, Issue
13, July 2008, Pages i232–i240,
https://doi.org/10.1093/bioinformatics/btn162
Binary interactions between ion channels and drug molecules.
This is one of four gold-standand datasets for drug-protein interaction
prediction introduced by Yamanishi et al., 2008 [1]_.
The input features are similarity matrices among the instances on each
axis and the target matrix is a binary interaction matrix determining the
the existence of an experimentally validated interaction (value 1) or the
absence of information about an interaction (value 0).
The score of a Smith-Waterman pairwise alignment is taken as the the
similarity between proteins, whereas the SIMCOMP score is used for the
similarity between drug molecules.
Minoru Kanehisa, Prediction of drug–target interaction networks from the
integration of chemical and genomic spaces, Bioinformatics, Volume 24, Issue
13, July 2008, Pages i232–i240,
https://doi.org/10.1093/bioinformatics/btn162
Binary interactions between nuclear receptors and drug molecules.
This is one of four gold-standand datasets for drug-protein interaction
prediction introduced by Yamanishi et al., 2008 [1]_.
The input features are similarity matrices among the instances on each
axis and the target matrix is a binary interaction matrix determining the
the existence of an experimentally validated interaction (value 1) or the
absence of information about an interaction (value 0).
The score of a Smith-Waterman pairwise alignment is taken as the the
similarity between proteins, whereas the SIMCOMP score is used for the
similarity between drug molecules.
Minoru Kanehisa, Prediction of drug–target interaction networks from the
integration of chemical and genomic spaces, Bioinformatics, Volume 24, Issue
13, July 2008, Pages i232–i240,
https://doi.org/10.1093/bioinformatics/btn162
Return the path of the bipartite_learn data directory. This folder is
used by some large dataset loaders to avoid downloading the data several
times. By default the data directory is set to a folder named
‘bipartite_learn_data’ in the user home folder. Alternatively, it can be
set by the ‘BIPARTITE_LEARN_DATA’ environment variable or programmatically
by giving an explicit folder path. The ‘~’ symbol is expanded to the user
home folder. If the folder does not already exist, it is automatically
created.
:param data_home: The path to bipartite_learn data directory. If None, the default path
is ~/bipartite_learn_data.
Returns:
data_home – The path to bipartite_learn data directory.