torch_frame.datasets.Mercari

class Mercari(root: str, num_rows: Optional[int] = None, col_to_text_embedder_cfg: Optional[Union[dict[str, torch_frame.config.text_embedder.TextEmbedderConfig], TextEmbedderConfig]] = None)[source]

Bases: Dataset

The Mercari Price Suggestion Challenge dataset from Kaggle.

Parameters:

num_rows (int, optional) – Number of rows to subsample. (default: None)

STATS:

#rows

#cols (numerical)

#cols (categorical)

#cols (text_embedded)

Task

Missing value ratio

1,482,535

1

4

2

regression

0.0%