The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Last update: Dec 23, 2022

Overview

SCROLLS

This repository contains the official code of the paper: "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Citation

@misc{shaham2022scrolls,
      title={SCROLLS: Standardized CompaRison Over Long Language Sequences}, 
      author={Uri Shaham and Elad Segal and Maor Ivgi and Avia Efrat and Ori Yoran and Adi Haviv and Ankit Gupta and Wenhan Xiong and Mor Geva and Jonathan Berant and Omer Levy},
      year={2022},
      eprint={2201.03533},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Loading the SCROLLS Benchmark Datasets

via 🤗 Datasets (huggingface/datasets) library (recommended):

Installation

Usage:

from datasets import load_dataset

qasper_dataset = load_dataset("tau/scrolls", "qasper")
"""
Options are: ["gov_report", "summ_screen_fd", "qmsum", "narrative_qa", "qasper", "quality", "contract_nli"]
"""

via ZIP files, where each split is in a JSONL file:
- GovReport
- SummScreenFD
- QMSum
- NarrativeQA
- Qasper
- QuALITY
- ContractNLI

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Related tags

Overview

SCROLLS

Links

Citation

Loading the SCROLLS Benchmark Datasets

Owner

TAU NLP Group

[WWW 2022] Zero-Shot Stance Detection via Contrastive Learning

Simulating an AI playing 2048 using the Expectimax algorithm

BackgroundRemover lets you Remove Background from images and video with a simple command line interface

Dynamic View Synthesis from Dynamic Monocular Video

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

This code is for eCaReNet: explainable Cancer Relapse Prediction Network.

A Kaggle competition: discriminate gender based on handwriting

This is RFA-Toolbox, a simple and easy-to-use library that allows you to optimize your neural network architectures using receptive field analysis (RFA) and create graph visualizations of your architecture.

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

A simple, fully convolutional model for real-time instance segmentation.

This is the code repository implementing the paper "TreePartNet: Neural Decomposition of Point Clouds for 3D Tree Reconstruction".

A Python type explainer!

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

An intuitive library to extract features from time series

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Spectral Temporal Graph Neural Network (StemGNN in short) for Multivariate Time-series Forecasting

In this project, we'll be making our own screen recorder in Python using some libraries.

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression

HMLET (Hybrid-Method-of-Linear-and-non-linEar-collaborative-filTering-method)