bipartite_learn package#

Submodules#

bipartite_learn.base module#

class bipartite_learn.base.BaseBipartiteEstimator#: Bases: BaseMultipartiteEstimator

class bipartite_learn.base.BaseMultipartiteEstimator#

Bases: BaseEstimator

Base class for multipartite estimators.

score(X, y, sample_weight=None)#

class bipartite_learn.base.BaseMultipartiteSampler#

Bases: MultipartiteSamplerMixin

Base class for multipartite samplers.

fit_resample(X, y)#

Resample the dataset.

Parameters:

X ({array-like, dataframe, sparse matrix} of shape (n_samples, n_features)) – Matrix containing the data which have to be sampled.
y (array-like of shape (n_samples,)) – Corresponding label for each sample in X.

Returns:

X_resampled ({array-like, dataframe, sparse matrix} of shape (n_samples_new, n_features)) – The array containing the resampled data.
y_resampled (array-like of shape (n_samples_new,)) – The corresponding label of X_resampled.

sampling_strategy = 'auto'#

class bipartite_learn.base.MultipartiteSamplerMixin#: Bases: BaseMultipartiteEstimator, SamplerMixin

class bipartite_learn.base.MultipartiteTransformerMixin#

Bases: TransformerMixin

Mixin for multipartite transformers.

bipartite_learn.melter module#

class bipartite_learn.melter.BipartiteMelter#

Bases: BaseMultipartiteSampler, BaseBipartiteEstimator

Convert a bipartite dataset to a simpler global-single output format.

Convert a bipartite interaction problem, where there are two feature matrices in X (one for each axis) and an interaction matrix y to a simpler usual format where each sample is a combination of samples from X[0] and X[1].

Slightly faster than MultipartiteMelter.

bipartite_learn.melter.melt_multipartite_dataset(X, y=None)#

Melt bipartite input.

If X is a list of Xi feature matrices, one for each bipartite group, convert it to traditional data format by generating concatenations of rows from X[0] with rows from X[1].

bipartite_learn.melter.row_cartesian_product(X)#

Row cartesian product of 2D arrays in X.

Pick one row from each of the 2D arrays in X, in their presented order, and concatenate them. Repeat. Return a 2D array where its rows are all the possible combinations of rows in X.

Parameters:: X (list-like of 2D np.ndarrays) –
Returns:: result – Cartesian product of X’s 2d arrays, row-wise.
Return type:: 2D np.ndarray

bipartite_learn.neighbors module#

Distance Weighted Neighbors Regression.

class bipartite_learn.neighbors.WeightedNeighborsRegressor(*, weights='distance', p=2, metric='minkowski', metric_params=None, n_jobs=None)#

Bases: KNeighborsMixin, RegressorMixin, NeighborsBase

fit(X, y)#

Fit the k-nearest neighbors regressor from the training dataset. :param X: Training data. :type X: {array-like, sparse matrix} of shape (n_samples, n_features) or (n_samples, n_samples) if metric=’precomputed’ :param y: Target values. :type y: {array-like, sparse matrix} of shape (n_samples,) or (n_samples, n_outputs)

Returns:: self – The fitted k-nearest neighbors regressor.
Return type:: KNeighborsRegressor

predict(X)#

Predict the target for the provided data. :param X: Test samples. :type X: {array-like, sparse matrix} of shape (n_queries, n_features), or (n_queries, n_indexed) if metric == ‘precomputed’

Returns:: y – Target values.
Return type:: ndarray of shape (n_queries,) or (n_queries, n_outputs), dtype=int

bipartite_learn.pipeline module#

bipartite_learn.pipeline.make_multipartite_pipeline(*steps, ndim=2, memory=None, verbose=False)#

Utility function to create pipelines for multipartite data.

It wraps monopartite transformers with MultipartiteTransformerWrapper.

bipartite_learn.wrappers module#

Set of tools to apply standard estimators to bipartite datasets.

TODO: Docs. TODO: check fit inputs.

class bipartite_learn.wrappers.GlobalSingleOutputWrapper(estimator: BaseEstimator, under_sampler: BaseSampler | None = None)#

Bases: BaseMultipartiteEstimator, MetaEstimatorMixin

Employ the GSO strategy to adapt sstandard estimators to bipartite data.

In this strategy, the estimator is applied to concatenations of a feature vector from the first sample domain with a feature vector from the second domain, while y is considered a unidimensional vector.

See also

GlobalSingleOutputWrapper: A wrapper that fits a single-output estimator to bipartite datasets.
MultiOutputRegressor: A scikit-learn wrapper that fits a separate regressor for each output variable.
MultiOutputClassifier: A scikit-learn wrapper that fits a separate classifier for each output variable.

Examples

from bipartite_learn.datasets import NuclearReceptorsLoader
from bipartite_learn.wrappers import LocalMultiOutputWrapper
from sklearn.svm import SVC
from sklearn.neighbors import KNeighborsClassifier
from sklearn.multioutput import MultiOutputClassifier

X, y = NuclearReceptorsLoader().load()  # X is a list of two matrices
bipartite_clf = LocalMultiOutputWrapper(
    primary_rows_estimator=MultiOutputClassifier(SVC()),
    primary_cols_estimator=MultiOutputClassifier(SVC()),
    secondary_rows_estimator=KNeighborsClassifier(),
    secondary_cols_estimator=KNeighborsClassifier(),
)
bipartite_clf.fit(X, y)

References

property classes_#: The classes labels. Only exist if the estimator is a classifier.

decision_function(X, **decision_function_params)#

property feature_names_in_#: Names of features seen during first step fit method.

fit(X, y, **fit_params)#

Fits the wrapper to the training data.

Raises:: IncompatibleEstimatorsError – If any of the estimators passed as arguments does not support multi-output functionality. If the secondary estimators are not of the same type (e.g., regressor, classifier). If only one of the primary estimators is pairwise.

fit_predict(X, y=None, **fit_params)#

property n_features_in_#: Number of features seen during fit.

predict(X, **predict_params)#

predict_log_proba(X, **predict_log_proba_params)#

predict_proba(X, **predict_proba_params)#

score(X, y=None)#

class bipartite_learn.wrappers.MultipartiteSamplerWrapper(samplers: BaseEstimator | Sequence[BaseEstimator], ndim: int | None = 2)#

Bases: BaseMultipartiteSampler

Manages a sampler for each feature space in multipartite datasets.

class bipartite_learn.wrappers.MultipartiteTransformerWrapper(transformers: BaseEstimator | Sequence[BaseEstimator], ndim: int | None = 2)#

Bases: BaseMultipartiteEstimator, TransformerMixin

Manages a transformer for each feature space in multipartite datasets.

fit(X, y=None)#

fit_transform(X, y=None)#

Fit to data, then transform it.

Fits transformer to X and y with optional parameters fit_params and returns a transformed version of X.

Parameters:

X (array-like of shape (n_samples, n_features)) – Input samples.
y (array-like of shape (n_samples,) or (n_samples, n_outputs), default=None) – Target values (None for unsupervised transformations).
**fit_params (dict) – Additional fit parameters.

Returns:

X_new – Transformed array.

Return type:

ndarray array of shape (n_samples, n_features_new)

transform(X, y=None)#

bipartite_learn package#

Subpackages#

Submodules#

bipartite_learn.base module#

bipartite_learn.melter module#

bipartite_learn.neighbors module#

bipartite_learn.pipeline module#

bipartite_learn.wrappers module#