Dataset¶
Utils¶
utils.get_download_dir () |
Get the absolute path to the download directory. |
utils.download (url[, path, overwrite, …]) |
Download a given URL. |
utils.check_sha1 (filename, sha1_hash) |
Check whether the sha1 hash of the file content matches the expected hash. |
utils.extract_archive (file, target_dir) |
Extract archive file. |
Dataset Classes¶
Stanford sentiment treebank dataset¶
For more information about the dataset, see Sentiment Analysis.
-
class
dgl.data.
SST
(mode='train', vocab_file=None)[source]¶ Stanford Sentiment Treebank dataset.
Each sample is the constituency tree of a sentence. The leaf nodes represent words. The word is a int value stored in the
x
feature field. The non-leaf node has a special valuePAD_WORD
in thex
field. Each node also has a sentiment annotation: 5 classes (very negative, negative, neutral, positive and very positive). The sentiment label is a int value stored in they
feature field.Note
This dataset class is compatible with pytorch’s
Dataset
class.Note
All the samples will be loaded and preprocessed in the memory first.
Parameters: -
__getitem__
(idx)[source]¶ Get the tree with index idx.
Parameters: idx (int) – Tree index. Returns: Tree. Return type: dgl.DGLGraph
-