Natural Language and Information Processing Research Group
2023
Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues
Journal of Child Language. 2023
[pdf]
Automated hate speech detection and span extraction in underground hacking and extremist forums
Natural Language Engineering. 2023
[pdf]
On the application of Large Language Models for language teaching and assessment technology
Proceedings of Empowering Education with LLMs – the Next-Gen Interface and Content Generation. 2023
[pdf]
Argot as a Trust Signal: Slang, Jargon & Reputation on a Large Cybercrime Forum
Proceedings of the 22nd Annual Workshop on the Economics of Information Security. 2023
[pdf]
MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection
Proceedings of the 12th Workshop on NLP for Computer Assisted Language Learning. 2023
[pdf]
Visual Spatial Reasoning
Transactions of the Association for Computational Linguistics (TACL). 2023
[pdf]
Functional Distributional Semantics at Scale
Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM). 2023
[pdf]
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval 2023)
[pdf]
On the Intersection of Context-Free and Regular Languages
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics. 2023
[pdf]
On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation
Proceedings of the International Conference on Learning Representations (ICLR). 2023
[pdf]
On the Effect of Anticipation on Reading Times
Transactions of the Association for Computational Linguistics. 2023
[pdf]
A survey on recent approaches to Question Difficulty Estimation from text
ACM Computing Surveys. 2023
[pdf]
Probabilistic Lexical Semantics: From Gaussian Embeddings to Bernoulli Fields
Probabilistic Approaches to Linguistic Theory. 2023
2022
Varifocal Question Generation for Fact-checking
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
[pdf]
The Architectural Bottleneck Principle
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
[pdf]
Opening up Minds with Argumentative Dialogues
Findings of the Association for Computational Linguistics: EMNLP 2022
[pdf]
CEPOC: The Cambridge Exams Publishing Open Cloze dataset
Proceedings of the 2022 International Conference on Language Resources and Evaluation (LREC 2022)
[pdf]
Prompting for a conversation: How to control a dialog model?
Proceedings of the 2nd Workshop on When Creative AI Meets Conversational AI 29th International Conference on Computational Linguistics. 2022
[pdf]
On the Role of Negative Precedent in Legal Outcome Prediction
Transations of the Association for Computational Linguistics. 2022
[pdf]
Benchmarking Compositionality with Formal Languages
Proceedings of the 29th International Conference on Computational Linguistics. 2022
[pdf]
Identifying relevant common sense information in knowledge graphs
Proceedings of the First Workshop on Commonsense Representation and Reasoning. 2022
[pdf]
Using machine learning to create a repository of judgments concerning a new practice area: a case study in animal protection law
Artificial Intelligence and Law. 2022
[pdf]
20 years of the Grammar Matrix: cross-linguistic hypothesis testing of increasingly complex interactions
Journal of Language Modelling. 2022
[pdf]
Using dependency parsing for few-shot learning in distributional semantics
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. 2022
[pdf]
Extended Rater Representations in the Many-Facet Rasch Model
Journal of Applied Measurement. 2022
Accelerating Human Translation of Public Health Information into Low-Resource Languages with Machine Translation
Cambridge Occasional Papers in Linguistics. 2022
[pdf]
ALEN App: Argumentative Writing Support To Foster English Language Learning
Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022
[pdf]
Towards an open-___domain chatbot for language practice
Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022
[pdf]
The Specificity and Helpfulness of Peer-to-Peer Feedback in Higher Education
Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022
[pdf]
POSTCOG: A tool for interdisciplinary research into underground forums at scale
Proceedings of WACCO. 2022
[pdf]
Probing for targeted syntactic knowledge through grammatical error detection
Proceedings of the 2022 SIGNLL Conference on Computational Natural Language Learning
Naturalistic Causal Probing for Morpho-Syntax
Transactions of the Association for Computational Linguistics. 2022
[pdf]
Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective
SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022
[pdf]
Probing for the Usage of Grammatical Number
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022
[pdf]
Analyzing Wrap-Up Effects through an Information-Theoretic Lens
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022
[pdf]
On the probability-quality paradox in language generation
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022
[pdf]
Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers
Findings of the Association for Computational Linguistics: ACL 2022
Learning Functional Distributional Semantics with Visual Data
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022
[pdf]
2021
Non-Iterative Conditional Pairwise Estimation for the Rating Scale Model
Educational and Psychological Measurement. 2021
[pdf]
Word Complexity is in the Eye of the Beholder
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
[pdf]
Predicting Text Readability from Scrolling Interactions
Proceedings of the 25th Conference on Computational Natural Language Learning. 2021
[pdf]
Efficient Unsupervised NMT for Related Languages with Cross-Lingual Language Models and Fidelity Objectives
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects. 2021
[pdf]
A surprisal–duration trade-off across and within the world’s languages
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
[pdf]
Revisiting the Uniform Information Density Hypothesis
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
[pdf]
A Bayesian Framework for Information-Theoretic Probing
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
[pdf]
On Homophony and Rényi Entropy
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
[pdf]
Disambiguatory Signals are Stronger in Word-initial Positions
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 2021
[pdf]
Modeling the Unigram Distribution
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
[pdf]
A Non-Linear Structural Probe
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
[pdf]
What About the Precedent: An Information-Theoretic Analysis of Common Law
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
[pdf]
Finding Concept-specific Biases in Form–Meaning Associations
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
[pdf]
How (Non-)Optimal is the Lexicon?
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
[pdf]
Incremental Beam Manipulation for Natural Language Generation
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 2021
[pdf]
Synthetic Textual Features for the Large-Scale Detection of Basic-level Categories in English and Mandarin
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
[pdf]
Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics
Transactions of the Association for Computational Linguistics (TACL). 2021
[pdf]
Computational linguistics and grammar engineering
Head-Driven Phrase Structure Grammar: The handbook. 2021
[pdf]
2020
Analyzing Neural Discourse Coherence Models
Proceedings of the First Workshop on Computational Approaches to Discourse. 2020
[pdf]
The Teacher-Student Chatroom Corpus
Proceedings of the 9th Workshop on NLP for Computer Assisted Language Learning (NLP4CALL). 2020
[pdf]
Morphologically Aware Word-Level Translation
Proceedings of the 2020 International Conference on Computational Linguistics (COLING)
[pdf]
A Graph Based Framework for Structured Prediction Tasks in Sanskrit
Computational Linguistics. 2020
[pdf]
Keep it Surprisingly Simple: A Simple First Order Graph Based Parsing Model for Joint Morphosyntactic Parsing in Sanskrit
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Verbal Multiword Expressions for Identification of Metaphor
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020)
[pdf]
Seeing Both the Forest and the Trees: Multi-head Attention for Joint Classification on Different Compositional Levels
The 28th International Conference on Computational Linguistics (COLING-2020)
[pdf]
Grammatical error detection in transcriptions of spoken English
The 28th International Conference on Computational Linguistics (COLING-2020)
[pdf]
Coding Textual Inputs Boosts the Accuracy of Neural Networks
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
CanVEC - the Canberra Vietnamese-English code-switching natural speech corpus
Proceedings of The 12th Language Resources and Evaluation Conference. 2020
[pdf]
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions
Proceedings of ACL. 2020
[pdf]
REPROLANG 2020: Automatic Proficiency Scoring of Czech, English, German, Italian, and Spanish learner essays
Proceedings of LREC. 2020
[pdf]
Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Multiple Question Fronting without Relational Constraints: An Analysis of Russian as a Basis for Cross-Linguistic Modeling
Proceedings of the 27th International Conference on Head-Driven Phrase Structure Grammar (HPSG). 2020
[pdf]
Linguists Who Use Probabilistic Models Love Them: Quantification in Functional Distributional Semantics
Proceedings of the Probability and Meaning Conference (PaM 2020)
[pdf]
Please Mind the Root: Decoding Arborescences for Dependency Parsing
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Speakers Fill Lexical Semantic Gaps with Context
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Pareto Probing: Trading Off Accuracy for Complexity
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Information-Theoretic Probing for Linguistic Structure
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
[pdf]
A Corpus for Large-Scale Phonetic Typology
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
[pdf]
Predicting Declension Class from Form and Meaning
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
[pdf]
A Tale of a Probe and a Parser
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
[pdf]
Phonotactic Complexity and Its Trade-offs
Transactions of the Association for Computational Linguistics. 2020
[pdf]
Leveraging sentence similarity in natural language generation: Improving beam search using range voting
Proceedings of the 4th Workshop on Neural Generation and Translation (WNGT). 2020
[pdf]
What are the Goals of Distributional Semantics?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
[pdf]
Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
[pdf]
2019
Meaning to Form: Measuring Systematicity as Information
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
[pdf]
Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
[pdf]
Active Learning for Financial Investment Reports
Proceedings of the Second Financial Narrative Processing Workshop (FNP 2019)
[pdf]
Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)
[pdf]
Multi-Task Learning for Coherence Modeling
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
[pdf]
Entropy as a proxy for gap complexity in open cloze tests
Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP 2019)
[pdf]
The BEA-2019 Shared Task on Grammatical Error Correction
Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2019)
[pdf]
Recursive Context-Aware Lexical Simplification
Proceedings of the EMNLP 2019
Complex Word Identification as a Sequence Labelling Task
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
[pdf]
Comparative judgments are more consistent than binary classification for labelling word complexity
Proceedings of the 13th Linguistic Annotation Workshop. 2019
[pdf]
Words are Vectors, Dependencies are Matrices: Learning Word Embeddings from Dependency Graphs
Proceedings of the 13th International Conference on Computational Semantics (IWCS). 2019
[pdf]
The cross-linguistic performance of statistical word segmentation models
Journal of Child Language 46(6): 1169-1201. 2019
Overview of the 2019 Spoken CALL Shared Task
Proceedings of the 8th ISCA Workshop on Speech and Language Technology in Education (SLaTE). 2019
Skills Embeddings: a neural approach to multicomponent representations of students and tasks
Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)
Accurate modelling of language learning tasks and students using representations of grammatical proficiency
Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)
Automatic homework selection with deep behavioural cloning
Proceedings of the 20th International Conference on Artificial Intelligence in Education (AIED 2019)
Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models
Proceedings of the Second Workshop on Deep Learning for Low-Resource NLP (DeepLo 2019)
[pdf]
Modelling the interplay of metaphor and emotion through multitask learning
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)
Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)
Neural and FST-based approaches to grammatical error correction
Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019)
[pdf]
Context is Key: Grammatical Error Detection with Contextual Word Representations
Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019)
[pdf]
CAMsterdam at SemEval-2019 Task 6: Neural and graph-based featureextraction for the identification of offensive tweets
Proceedings of the International Workshop on Semantic Evaluation 2019 (SemEval 2019)
[pdf]
Factorising AMR generation through syntax
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
A Simple Joint Model for Improved Contextual Neural Lemmatization
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
On the Idiosyncrasies of the Mandarin Chinese Classifier System
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
A Probabilistic Generative Model of Linguistic Typology
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Combining Disparate Sentiment Lexica with a Multi-View Variational Autoencoder
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Gender Bias in Contextualized Word Embeddings
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Contextualization of Morphological Inflection
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
A Simple and Robust Approach to Detecting Subject-Verb Agreement Errors
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
[pdf]
Abusive Language Detection with Graph Convolutional Networks
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Broader context improves metaphor identification
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Neural Grammatical Error Correction with Finite State Transducers
Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Automated Fact Checking in the News Room
Proceedings of The WebConf 2019 Conference Demonstrations
Strong Baselines for Complex Word Identification across Multiple Languages
Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Generating Token-Level Explanations for Natural Language Inference
Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Jointly Learning to Label Sentences and Tokens
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019)
[pdf]
2018
CAMB at CWI Shared Task 2018: Complex Word Identification with Ensemble-Based Voting
Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, pages 184-194, New Orleans, Louisiana, June 5, 2018
[pdf]
Functional Distributional Semantics: Learning Linguistically Informed Representations from a Precisely Annotated Corpus
PhD thesis, University of Cambridge. 2018
[pdf]
Emergent Communication Through Negotiation
International Conference on Learning Representations. 2018
Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2018
[pdf]
Author Profiling for Abuse Detection
Proceedings of the 27th International Conference on Computational Linguistics. 2018
[pdf]
Developing an Automated Writing Placement System for ESL Learners
Journal of Applied Measurement in Education. 2018
[pdf]
Neural Character-based Composition Models for Abuse Detection
Proceedings of the EMNLP 2018 Workshop on Abusive Language Online
[pdf]
Advance Prediction of Ventricular Tachyarrhythmias using Patient Metadata and Multi-Task Networks
Proceedings of the NIPS Workshop on Machine Learning for Health (ML4H 2018)
[pdf]
Characterizing Eve: Analysing Cybercrime Actors in a Large Underground Forum
Proceedings of the 21st International Symposium on Research in Attacks, Intrusions and Defenses (RAID 2018)
You Still Talking to Me?' The Zero Auxiliary Progressive in Spoken British English, Twenty Years On
In: Vaclav Brezina, Robbie Love and Karin Aijmer (eds.), Corpus Approaches to Contemporary British Speech: Sociolinguistic Studies of the Spoken BNC2014. 2018
Overview of the 2018 Spoken CALL Shared Task
Proceedings of INTERSPEECH. 2018
Impact of ASR Performance on Free Speaking Language Assessment
Proceedings of INTERSPEECH. 2018
Aggressive language in an online hacking forum
Proceedings of the 2nd Abusive Language Workshop (ALW 2018)
How clever is the FiLM model, and how clever can it be?
Workshop on Shortcomings in Vision and Language (ECCV 2018)
[pdf]
Deep learning evaluation using deep linguistic processing
Workshop on Generalization in the Age of Deep Learning (NAACL 2018)
[pdf]
Neural sequence modelling for learner error prediction
The 13th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2018)
[pdf]
Sequence classification with human attention
Proceedings of the SIGNLL Conference on Computational Natural Language Learning (CoNLL 2018)
[pdf]
Scoring Lexical Entailment with a Supervised Directional Similarity Network
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)
[pdf]
Towards automatically generating supply chain maps from natural language text
Proceedings of INCOM2018 (To Appear)
Language Model Based Grammatical Error Correction without Annotated Training Data
The 13th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2018
[pdf]
Zero-shot Sequence Labeling through Transfer Learning
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018)
[pdf]
Variable Typing: Assigning Meaning to Variables in Mathematical Text
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018)
[pdf]
2017
Finding enthymemes in real-world texts: A feasibility study
Journal of Argument & Computation, pp. 1-17, doi: 10.3233/AAC-170020. 2017
Speaking, Seeing, Understanding: Correlating semantic models with conceptual representation in the brain
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Semantic Composition via Probabilistic Model Theory
Proceedings of the 12th International Conference on Computational Semantics (IWCS). 2017
[pdf]
Variational Inference for Logical Inference
Proceedings of the 2017 Conference on Logic and Machine Learning in Natural Language (LaML)
[pdf]
Initializing neural networks for hierarchical multi-label text classification
BioNLP 2017
Cancer Hallmarks Analytics Tool (CHAT): A text mining approach to organise and evaluate scientific literature on cancer
Bioinformatics. 2017
[pdf]
Detecting Off-topic Responses to Visual Prompts
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
[pdf]
An Error-Oriented Approach to Word Embedding Pre-Training
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
[pdf]
Auxiliary Objectives for Neural Error Detection Models
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
[pdf]
Artificial Error Generation with Machine Translation and Syntactic Patterns
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
[pdf]
Neural Sequence-Labelling Models for Grammatical Error Correction
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Grasping the finer point: A Supervised Similarity Network for Metaphor Detection
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
Semi-supervised Multitask Learning for Sequence Labeling
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL). 2017
[pdf]
Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL). 2017
[pdf]
The Representational Geometry of Word Embeddings Learned by Neural MT Systems
MT. 2017
Latent Variable Dialogue Models and their Diversity
Proceedings of the short papers of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017)
[pdf]
Learning to Negate Adjectives with Bilinear Models
Proceedings of the short papers of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017)
[pdf]
Modelling metaphor with attribute-based semantics
Proceedings of the short papers of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017)
[pdf]
Modelling semantic acquisition in second language learning
BEA 2017
Multilingual Metaphor Processing: Experiments with Semi-supervised and Unsupervised Learning
Computational Linguistics. 2017
[pdf]
2016
The Goldilocks Principle: Reading Children's Books with Explicity Memory Representations
Proceedings of the International Conference on Learning Representations (ICLR). 2016
Automatic Extraction of Learner Errors in ESL Sentences Using Linguistically Enhanced Alignments
The 26th International Conference on Computational Linguistics (COLING-2016)
Robust Text Classification for Sparsely Labelled Data Using Multi-level Embeddings
The 26th International Conference on Computational Linguistics (COLING-2016)
[pdf]
A Proposition-Based Abstractive Summariser
The 26th International Conference on Computational Linguistics (COLING-2016)
[pdf]
Attending to characters in neural sequence labeling models
The 26th International Conference on Computational Linguistics (COLING-2016)
[pdf]
Recognising enthymemes in real-world texts: a feasibility study
Proceedings of the 6th COMMA Workshop on the Foundations of the Language of Argumentation. 2016
[pdf]
RELPRON: A Relative Clause Composition Data Set for Compositional Distributional Semantics
Computational Linguistics. 2016
[pdf]
Predicting the Direction of Derivation in English Conversion
Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. 2016
[pdf]
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL-16). 2016
[pdf]
SLEDDED: A Proposed Dataset of Event Descriptions for Evaluating Phrase Representations
Proceedings of The First Workshop on Evaluating Vector Space Representations for NLP (RepEval). 2016
[pdf]
Meta4meaning
Proceedings of The Seventh International Conference on Computational Creativity. 2016
[pdf]
The Categorial Framework for Compositional Distributional Semantics
Technical Report, University of Cambridge Computer Laboratory. 2016
[pdf]
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics
Empirical Methods in Natural Language Processing Conference. 2016
[pdf]
Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
[pdf]
Issues in preprocessing current datasets for grammatical error correction
Technical report UCAM-CL-TR-895, Computer Laboratory, University of Cambridge. 2016
[pdf]
Artificial error generation for translation-based grammatical error correction
Ph.D. thesis, University of Cambridge. 2016
[pdf]
Predicting the impact of scientific concepts using full‐text features
Journal of the Association for Information Science and Technology. 2016
[pdf]
Meta4meaning: Automatic Metaphor Interpretation Using Corpus-Derived Word Associations
Proceedings 7th International Conference on Computational Creativity (ICCC 2016)
What Happens Next? Event Prediction Using a Compositional Neural Network Model
Proceedings 13th AAAI Conference on Artificial Intelligence (AAAI 2016)
[pdf]
Extracting Structured Scholarly Information from the Machine Translation Literature
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
[pdf]
Candidate re-ranking for SMT-based grammatical error correction
Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications. 2016
[pdf]
Constrained Multi-Task Learning for Automated Essay Scoring
Association for Computational Linguistics (ACL). 2016
[pdf]
Don’t Interrupt Me While I Type: Inferring Text Entered Through Gesture Typing on Android Keyboards
Proceedings of 16th Privacy Enhancing Technologies Symposium. 2016
[pdf]
Expected F-measure Training for Shift-Reduce Parsing with Recurrent Neural Networks
Proceedings of NAACL. 2016
[pdf]
Automatic semantic classification of scientific literature according to the hallmarks of cancer
Bioinformatics. 2016 Feb 1;32(3):432-40
[pdf]
Improving Argument Overlap for Proposition-Based Summarisation
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 479-485. 2016
[pdf]
Calling on the classical phone': a distributional model of adjective-noun errors in learners' English
COLING 2016
Vision and Feature Norms: Improving automatic feature norm learning through cross-modal maps
Proceedings of NAACL-HLT, 579-588, 2016
[pdf]
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment
arXiv preprint arXiv:1608.02117, 2016
[pdf]
Multi-modal representations for improved bilingual lexicon learning
The 54th Annual Meeting of the Association for Computational Linguistics, 188, 2016
[pdf]
Virtual Embodiment: A Scalable Long-Term Strategy for Artificial Intelligence Research
arXiv preprint arXiv:1610.07432, 2016
[pdf]
Metaphor: A Computational Perspective
Synthesis Lectures on Human Language Technologies. Edited by Graeme Hirst. Morgan & Claypool, USA. 2016
Functional Distributional Semantics
The 1st Workshop on Representation Learning for NLP (RepL4NLP-2016)
[pdf]
Resources for Building Applications with Dependency Minimal Recursion Semantics
The 10th International Conference on Language Resources and Evaluation. 2016
[pdf]
Unsupervised Modeling of Topical Relevance in L2 Learner Text
The 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2016
[pdf]
A Joint Model for Word Embedding and Word Morphology
The 1st Workshop on Representation Learning for NLP (RepL4NLP-2016)
[pdf]
Compositional Sequence Labeling Models for Error Detection in Learner Writing
The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016)
[pdf]
Automatic Text Scoring Using Neural Networks
The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016)
[pdf]
Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays
The 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2016
[pdf]
2015
Learning Distributed Representations of Sentences from Unlabelled Data
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Learning to Understand Phrases by Embedding the Dictionary
Transactions of the Association for Computational Linguistics. 2015
Vector Space Models of Lexical Meaning
Handbook of Contemporary Semantic Theory — second edition, edited by Shalom Lappin and Chris Fox. 2015
The Frobenius anatomy of word meanings II: possessive relative pronouns
Journal of Logic and Computation. 2015
[pdf]
Computational Syntax
Syntax – Theory and Analysis. An International Handbook. Handbooks of Linguistics and Communication Science. 2015
CCG Supertagging with a Recurrent Neural Network
Proceedings of the Short Papers of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)
[pdf]
Learning Adjective Meanings with a Tensor-Based Skip-Gram Model
SIGNLL Conference on Computational Natural Language Learning (CoNLL 2015)
[pdf]
Learning low-rank tensors for transitive verbs
Advances in Distributional Semantics Workshop. 2015
[pdf]
The Java Version of the C&C Parser: Version 0.95
Technical report, University of Cambridge Computer Laboratory, August. 2015
[pdf]
Low-Rank Tensors for Verbs in Compositional Distributional Semantics
Proceedings of the Short Papers of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)
[pdf]
An Exploration of Discourse-Based Sentence Spaces for Compositional Distributional Semantics
Proceedings of the Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem). 2015
[pdf]
Layers of interpretation: On grammar and compositionality
Proceedings of the 11th International Conference on Computational Semantics. 2015
[pdf]
Towards a standard evaluation method for grammatical error detection and correction
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics, Denver, CO, June. Association for Computational Linguistics
[pdf]
From distributional semantics to feature norms: grounding semantic models in human perceptual data
Proceedings of the Short Papers of the 11th International Conference on Computational Semantics (IWCS 2015), London, UK
[pdf]
Exploiting image generality for lexical entailment detection
Proceedings of the 53rd Annual Meeting of the Association for Computational ..., 2015
[pdf]
Multi-and cross-modal semantics beyond vision: Grounding in auditory perception
Proceedings of EMNLP, 2015
[pdf]
Unsupervised discovery of information structure in biomedical documents
Bioinformatics 31 (7), 1084-1092, 2015
[pdf]
Adaptive communication: Languages with more non-native speakers tend to have fewer word forms
PloS one 10 (6), e0128254, 2015
[pdf]
Visual bilingual lexicon induction with transferred convnet features
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
[pdf]
Lacking Integrity: HPSG as a Morphosyntactic Theory
The 22nd International Conference on Head-Driven Phrase Structure Grammar (HPSG). 2015
[pdf]
Leveraging a Semantically Annotated Corpus to Disambiguate Prepositional Phrase Attachment
The 11th International Conference on Computational Semantics. 2015
[pdf]
Evaluating the performance of Automated Text Scoring systems
The 10th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2015
[pdf]
Online Representation Learning in Recurrent Neural Language Models
The 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf]
2014
Simlex 999: Evaluating Semantic Models with Genuine Similarity Estimation
Computational Linguistics. 2014
A quantitative empirical analysis of the abstract/concrete distinction
Cognitive science. 2014
Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool
Frontiers in pharmacology. 2014
[pdf]
Enter search terms
. 2014
Learning Abstract Concept Embeddings from Multi-Modal Data: Since You Probably Can't See What I Mean.
EMNLP. 2014
[pdf]
Text mining for improved human exposure assessment
Toxicology Letters. 2014
Verb clustering for brazilian portuguese
International Conference on Intelligent Text Processing and Computational Linguistics. 2014
[pdf]
Automatic Extraction of Property Norm‐Like Data From Large Text Corpora
Cognitive Science. 2014
[pdf]
A text-mining approach for chemical risk assessment and cancer research
Toxicology Letters. 2014
Multi-modal models for concrete and abstract concept meaning
Transactions of the Association for Computational Linguistics. 2014
[pdf]
Distributional Lexical Entailment by Topic Coherence
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL-14). 2014
[pdf]
Sentiment analysis of scientific citations
Technical Report, University of Cambridge, Computer Laboratory. 2014
[pdf]
Learning a Theory of Marriage (and other relations) from a Web Corpus
Proceedings of the Short Papers of the European Conference on Information Retrieval (ECIR 2014)
[pdf]
A Type-Driven Tensor-Based Semantics for CCG
EACL 2014 Type Theory and Natural Language Semantics Workshop (TTNLS)
[pdf]
Practical Linguistic Steganography using Contextual Synonym Substitution and a Novel Vertex Coding Method
Computational Linguistics. 2014
[pdf]
Application-Driven Relation Extraction with Limited Distant Supervision
COLING-14 Aha!-Workshop on Information Discovery in Text. 2014
[pdf]
Learning to Identify Historical Figures for Timeline Creation from Wikipedia Articles
HistoInformatics 2014 - the 2nd International Workshop on Computational History
[pdf]
A New Corpus and Imitation Learning Framework for Context-Dependent Semantic Parsing
Transactions of the Association for Computational Linguistics (TACL). 2014
[pdf]
Generating artificial errors for grammatical error correction.
Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014)
[pdf]
To err is human, to correct is divine
XRDS: Crossroads, The ACM Magazine for Students, vol. 21 num. 1. 2014
[pdf]
CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment.
COLING (Demos). 2014
[pdf]
Probabilistic distributional semantics with latent variable models
Computational Linguistics. 2014
[pdf]
Improving Distributional Semantic Vectors through Context Selection and Normalisation
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 2014
[pdf]
Evaluation of Simple Distributional Compositional Operations on Longer Texts
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). 2014
[pdf]
Using Sentence Plausibility to Learn the Semantics of Transitive Verbs
NIPS workshop on Learning Semantics, Montreal, Canada. 2014
[pdf]
Baseline Methods for Automated Fictional Ideation
Proceedings 5th International Conference on Computational Creativity (ICCC 2014)
[pdf]
A Graph-Based Approach to String Regeneration
Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014)
[pdf]
TagNText: A parallel corpus for the induction of resource-specific non-taxonomical relations from tagged images.
LREC. 2014
[pdf]
A Summariser based on Human Memory Limitations and Lexical Competition
Proceedings of EACL, 732-741. 2014
[pdf]
Reducing Dimensions of Tensors in Type-Driven Distributional Semantics
Proceedings of the Empirical Methods in Natural Language Processing Conference (EMNLP 2014), Doha, Qatar
[pdf]
Detecting Learner Errors in the Choice of Content Words Using Compositional Distributional Semantics.
COLING 2014
[pdf]
Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More.
ACL (2), 835-841, 2014
[pdf]
Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics.
EMNLP, 36-45, 2014
[pdf]
ZIPF’S LAW ACROSS LANGUAGES OF THE WORLD: TOWARDS A QUANTITATIVE MEASURE OF LEXICAL DIVERSITY
Evolution of Language: Proceedings of the 10th International Conference ..., 2014
[pdf]
A systematic study of semantic vector space model parameters
Proceedings of the 2nd Workshop on Continuous Vector Space Models and their ..., 2014
[pdf]
Zipf's law and the grammar of languages: A quantitative study of Old and Modern English parallel texts
Corpus Linguistics and Linguistic Theory 10 (2), 175-211, 2014
[pdf]
Grammatical error correction using hybrid systems and type filtering
The Seventeenth Conference on Computational Natural Language Learning (CoNLL 2014): Shared Task
[pdf]
Using an Expert System to Automatically Map the Learning Profile of Individuals.
The Sixth International Conference on Mobile, Hybrid, and On-line Learning, eLmL. 2014
[pdf]
Looking for hyponyms in vector space
The Eighteenth Conference on Computational Natural Language Learning (CoNLL-14). 2014
[pdf]
2013
Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT)
Proceedings of the 31st Second Language Research Forum. Somerville, MA: Cascadilla Proceedings Project. 2013
[pdf]
The ef cambridge open language database (efcamdat) user manual part i: written production
. 2013
[pdf]
Minimally supervised learning for unconstrained conceptual property extraction
Proceedings of the 35th Annual Conference of the Cognitive Science Society. 2013
[pdf]
Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review
Bioinformatics. 2013
[pdf]
A tensor-based factorization model of semantic compositionality
Conference of the North American Chapter of the Association of Computational Linguistics (HTL-NAACL). 2013
[pdf]
Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints.
HLT-NAACL. 2013
[pdf]
Conceptual metaphor theory meets the data: a corpus-based human annotation study
Language resources and evaluation. 2013
[pdf]
UCAM-CORE: Incorporating structured distributional similarity into STS
Proceedings of *SEM 2013 Shared Task
[pdf]
Acquisition and Evaluation of Verb Subcategorization Resources for Biomedicine
Journal of Biomedical Informatics. 2013
[pdf]
Type-Driven Syntax and Semantics for Composing Meaning Vectors
Quantum Physics and Linguistics: A Compositional, Diagrammatic Discourse. 2013
[pdf]
Getting Creative with Semantic Similarity
Short Papers of the Seventh IEEE International Conference on Semantic Computing (ICSC-13). 2013
[pdf]
The Frobenius Anatomy of Relative Pronouns
13th Meeting on the Mathematics of Language (MoL 13). 2013
[pdf]
A quantum teleportation inspired algorithm produces sentence meaning from word meaning and grammatical structure
arXiv preprint arXiv:1305.0556. 2013
[pdf]
The Frobenius anatomy of word meanings I: subject and object relative pronouns
Journal of Logic and Computation. 2013
[pdf]
A computational model of logical metonymy
ACM Transactions on Speech and Language Processing (TSLP). 2013
[pdf]
Constrained grammatical error correction using Statistical Machine Translation
Proceedings of the Seventeenth Conference on Computational Natural Language Learning (CoNLL 2013): Shared Task
[pdf]
Reading tweeting minds: real-time analysis of short text for computational social science
Proceedings of the 24th ACM Conference on Hypertext and Social Media. 2013
[pdf]
SemEval-2013 task 4: Free paraphrases of noun compounds
Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013)
[pdf]
Dependency language models for sentence completion
Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing
[pdf]
Semantic Parsing as Machine Translation
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 2013
[pdf]
The Semi-generative Lexicon: Limits on Productivity
Advances in Generative Lexicon Theory, 455-474. 2013
Can distributional approaches improve on good old-fashioned lexical semantics?
IWCS Workshop Towards a Formal Distributional Semantics. 2013
[pdf]
Capturing Anomalies in the Choice of Content Words in Compositional Distributional Semantic Space.
RANLP 2013
[pdf]
Detecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models.
EMNLP, 1427-1432, 2013
[pdf]
Concreteness and corpora: A theoretical and practical analysis
Proceedings of the Workshop on Cognitive Modeling and Computational ..., 2013
[pdf]
Classifying Intermediate Learner English: A Data-driven Approach to Learner Corpora
In S. Granger, G. Gilquin & F. Meunier (eds) Twenty Years of Learner Corpus Research: Looking back, Moving ahead. Corpora and Language in Use – Proceedings 1, Louvain-la-Neuve: Presses universitaires de Louvain. 2013
[pdf]
Developing and Testing a Self-Assessment and Tutoring System
The 9th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2013
[pdf]
Parser lexicalisation through self-learning
The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2013)
[pdf]
Minimally supervised dependency-based methods for natural language processing
PhD thesis, University of Cambridge. 2013
[pdf]
2012
PANACEA (Platform for Automatic, Normalised Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies)
Proceedings of the 16th Annual Conference of the European Association for Machine Translation: EAMT 2012; 2012 May 28-30; Trento, Italy. Trento: Fondazione Bruno Kessler; 2012. p. 90
Semi-supervised learning for automatic conceptual property extraction
Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics. 2012
[pdf]
Exocrine pancreatic tumorigenesis and autotaxin expression
Toxicology Letters. 2012
A text mining approach for chemical cancer research and risk assessment
Toxicology Letters. 2012
Data and literature gathering in chemical cancer risk assessment
Integrated environmental assessment and management. 2012
CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature.
COLING (Demos). 2012
[pdf]
Multi-way Tensor Factorization for Unsupervised Lexical Acquisition
Proceedings of the 24th International Conference on Computational Linguistics (COLING-12). 2012
[pdf]
Merging Lexicons for Higher Precision Subcategorization Frame Acquisition
Proceedings of the LREC 2012 Workshop on Language Resource Merging
[pdf]
The Secret's in the Word Order: Text-to-Text Generation for Linguistic Steganography
COLING. 2012
[pdf]
Context-enhanced citation sentiment detection
Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
[pdf]
Detection of implicit citations for sentiment detection
Proceedings of the Workshop on Detecting Structure in Scholarly Discourse. 2012
[pdf]
Auralist: introducing serendipity into music recommendation
Proceedings of the fifth ACM international conference on Web search and data mining. 2012
[pdf]
Text mining for literature review and knowledge discovery in cancer risk assessment and research
PloS one. 2012
[pdf]
Talk of the city: Our tweets, our community happiness
Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM 2012)
[pdf]
Modelling selectional preferences in a lexical hierarchy
Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation. 2012
[pdf]
Learning syntactic verb frames using graphical models
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1. 2012
[pdf]
Talking places: Modelling and analysing linguistic content in foursquare
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
[pdf]
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora.
LREC. 2012
[pdf]
HOO 2012 Error Recognition and Correction Shared Task: Cambridge University Submission Report
BEA 2012
[pdf]
Automating Second Language Acquisition Research: Integrating Information Visualisation and Machine Learning
The EACL 2012 joint workshop of LINGVIS & UNCLH
[pdf]
Modeling Coherence in ESOL Learner Texts
The NAACL 2012 workshop on Innovative Use of Natural Language Processing for Building Educational Applications
[pdf]
2011
A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment
BMC bioinformatics. 2011
[pdf]
A weakly-supervised approach to argumentative zoning of scientific documents
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011
[pdf]
Latent vector weighting for word meaning in context
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011
[pdf]
Hierarchical verb clustering using graph factorization
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011
[pdf]
Weakly supervised learning of information structure of scientific abstracts—is it accurate enough to benefit real-world tasks in biomedicine?
Bioinformatics. 2011
[pdf]
Concrete sentence spaces for compositional distributional models of meaning
International Conference on Computational Semantics. 2011
[pdf]
Syntax-Based Grammaticality Improvement using CCG and Guided Search
Conference on Empirical Methods in Natural Language Processing. 2011
[pdf]
Syntactic processing using the generalized perceptron and beam search
Computational linguistics. 2011
[pdf]
Shift-reduce CCG parsing
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. 2011
[pdf]
Robust argumentative zoning for sensemaking in scholarly documents
Advanced Language Technologies for Digital Libraries (ALTDL), Lecture Notes in Computer Science. 2011
[pdf]
Formalising and specifying underquantification
Proceedings of the Ninth International Conference on Computational Semantics. 2011
[pdf]
Towards an on-demand simple portuguese wikipedia
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies. 2011
[pdf]
Exciting and interesting: issues in the generation of binomials
Proceedings of the UCNLG+ Eval: Language Generation and Evaluation Workshop. 2011
[pdf]
Introduction to Linguistics for Natural Language Processing
Computer Laboratory, University of Cambridge. 2011
[pdf]
Identification of a Writer’s Native Language by Error Analysis.
MPhil dissertation, Computer Laboratory. 2011
[pdf]
A New Dataset and Method for Automatically Grading ESOL Texts
The 49th Annual Meeting of the Association for Computational Linguistics. 2011
[pdf]
Unsupervised Entailment Detection between Dependency Graph Fragments
The 2011 Workshop on Biomedical Natural Language Processing (BioNLP-11)
[pdf]
Intelligent Information Access from Scientific Papers
Current Challenges in Patent Information Retrieval, edited by Mihai Lupu, Katja Mayer, John Tait and Anthony J. Trippe. 2011
[pdf]
2010
Large-scale acquisition of feature-based conceptual representations from textual corpora
The Annual Meeting of the Cognitive Science Society. 2010
[pdf]
Methods for the automatic acquisition of Language Resources and their evaluation methods
. 2010
[pdf]
Using fMRI activation to conceptual stimuli to evaluate methods for extracting conceptual representations from corpora
Proceedings of the NAACL HLT 2010 First Workshop on Computational Neurolinguistics
[pdf]
Acquiring human-like feature-based conceptual representations from corpora
Proceedings of the NAACL HLT 2010 First Workshop on Computational Neurolinguistics
[pdf]
Identifying the information structure of scientific abstracts: an investigation of three different schemes
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing
[pdf]
The acquisition of unrestricted feature-based conceptual representations from corpora
. 2010
Automatic lexical classification: bridging research and practice
Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2010
[pdf]
Investigating the cross-linguistic potential of verbnet: style classification
Proceedings of the 23rd International Conference on Computational Linguistics. 2010
[pdf]
Metaphor identification using verb and noun clustering
Proceedings of the 23rd International Conference on Computational Linguistics. 2010
[pdf]
Evaluation of Dependency Parsers on Unbounded Dependencies
Proceedings of the 23rd International Conference on Computational Linguistics (COLING-10). 2010
[pdf]
Mathematical foundations for a compositional distributional model of meaning
Linguistic Analysis. 2010
[pdf]
Supertagging for Efficient Wide-Coverage CCG Parsing
Supertagging: Using Complex Lexical Descriptions in Natural Language Processing. 2010
[pdf]
Statistical parsing
Handbook of Computational Linguistics and Natural Language Processing. 2010
[pdf]
Automated collage generation-with intent
Proceedings of the 1st International Conference on Computational Creativity. 2010
[pdf]
Linguistic steganography using automatically generated paraphrases
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
[pdf]
Faster parsing by supertagger adaptation
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 2010
[pdf]
Cambridge: Parser evaluation using textual entailment by grammatical relation comparison
Proceedings of the 5th International Workshop on Semantic Evaluation. 2010
[pdf]
Chart pruning for fast lexicalised-grammar parsing
Proceedings of the 23rd International Conference on Computational Linguistics: Posters. 2010
[pdf]
A fast decoder for joint word segmentation and POS-tagging using a single discriminative model
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
[pdf]
Practical linguistic steganography using contextual synonym substitution and vertex colour coding
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
[pdf]
Exploring variation across biomedical subdomains
In Proceedings of the 23rd International Conference on Computational Linguistics. 2010
Latent variable models of selectional preference
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 2010
[pdf]
Semeval-2010 task 8: Multi-way classification of semantic relations between pairs of nominals
Proceedings of the 5th International Workshop on Semantic Evaluation. 2010
[pdf]
SemEval-2010 task 9: The interpretation of noun compounds using paraphrasing verbs and prepositions
Proceedings of the 5th International Workshop on Semantic Evaluation. 2010
[pdf]
Exploring variations across biomedical subdomains
Proceedings of the 23rd International Conference on Computational Linguistics. 2010
[pdf]
Scaling the iHMM: Parallelization versus Hadoop
Proceedings of the Workshop on Scalable Machine Learning and Applications, IEEE International Conference on Computing and Information Technology. 2010
[pdf]
Two strong baselines for the BioNLP 2009 event extraction task
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing
[pdf]
Semi-supervised learning for biomedical information extraction
PhD thesis, University of Cambridge. 2010
[pdf]
Underquantification: an application to mass terms
Proceedings of Empirical, Theoretical and Computational Approaches to Countability in Natural Language, Bochum, Germany. 2010
[pdf]
Camtology: intelligent information access for science
Proceedings of the NAACL HLT 2010 Demonstration Session
[pdf]
Active learning for constrained Dirichlet process mixture models
Proceedings of the 2010 workshop on geometrical models of natural language semantics
[pdf]
Automated assessment of ESOL free text examinations
Computer Laboratory, University of Cambridge. 2010
[pdf]
Combining Manual Rules and Supervised Learning for Hedge Cue and Scope Detection
The 14th Conference on Natural Language Learning (CoNLL-10). 2010
[pdf]
2009
Automatic Lexical Classification--Balancing between Machine Learning and Linguistics.
PACLIC. 2009
[pdf]
Number sense disambiguation
Proceedings of the Conference of Pacific Association for Computational Linguistics (PACLING’09). 2009
[pdf]
Improved cancer risk assessment using text mining
Cancer Research. 2009
VerbNet overview, extensions, mappings and applications
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Tutorial Abstracts
User-driven development of text mining resources for cancer risk assessment
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing. 2009
[pdf]
Improving verb clustering with automatically acquired selectional preferences
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2-Volume 2
[pdf]
The first step in the development of text mining technology for cancer risk assessment: Identifying and organizing scientific evidence in risk assessment literature
BMC bioinformatics. 2009
[pdf]
Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data
Research on Language and Computation. 2009
[pdf]
EBMT for SMT: a new EBMT-SMT hybrid
Proceedings of the 3rd International Workshop on Example-Based Machine Translation. 2009
[pdf]
Comparing the accuracy of CCG and Penn Treebank parsers
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
[pdf]
Unbounded dependency recovery for parser evaluation
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2-Volume 2
[pdf]
Transition-based parsing of the Chinese treebank using a global discriminative model
Proceedings of the 11th International Conference on Parsing Technologies. 2009
[pdf]
Porting a lexicalized-grammar parser to the biomedical ___domain
Journal of biomedical informatics. 2009
[pdf]
An annotation scheme for citation function
Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue. 2009
[pdf]
Towards discipline-independent argumentative zoning: evidence from chemistry and computational linguistics
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3-Volume 3
[pdf]
Semantic classification with WordNet kernels
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
[pdf]
Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering
Proceedings of the ACL Workshop on Geometrical Models of Natural Language Semantics. 2009
[pdf]
The infinite HMM for unsupervised PoS tagging
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
[pdf]
Slacker semantics: why superficiality, dependency and avoidance of commitment can be the right way to go
Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. 2009
[pdf]
Using lexical and relational similarity to classify semantic relations
Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. 2009
[pdf]
Investigating content selection for language generation using machine learning
Proceedings of the 12th European Workshop on Natural Language Generation. 2009
[pdf]
Annotating genericity: How do humans decide? (A case study in ontology extraction)
Studies in Generative Grammar. 2009
[pdf]
What can formal or computational models tell us about how (much) language shaped the brain
Biological Foundations and the Origin of Syntax. 2009
[pdf]
Biomedical event extraction without training data
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task. 2009
[pdf]
2008
Automatic classification of English verbs using rich syntactic features
. 2008
A new challenge for text mining: Cancer risk assessment
Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. 2008
[pdf]
Verb class discovery from rich syntactic data
International Conference on Intelligent Text Processing and Computational Linguistics. 2008
[pdf]
Lexschem: A large subcategorization lexicon for french verbs
Language Resource and Evaluation conference. 2008
[pdf]
The choice of features for classification of verbs in biomedical texts
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. 2008
[pdf]
Constructing a parser evaluation scheme
Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation
[pdf]
Asknet: Creating and evaluating large scale integrated semantic networks
International Journal of Semantic Computing. 2008
[pdf]
A tale of two parsers: investigating and combining graph-based and transition-based dependency parsing using beam-search
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2008
[pdf]
Adapting a lexicalized-grammar parser to contrasting domains
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2008
[pdf]
Using terms from citations for IR: some first results
European Conference on Information Retrieval. 2008
[pdf]
Sentence-based emotion classification for text-to-speech
International Workshop on Computational Aspects of Affectual and Emotional Interaction. 2008
Comparing citation contexts for information retrieval
Proceedings of the 17th ACM conference on Information and knowledge management. 2008
[pdf]
Learning compound noun semantics
University of Cambridge, Cambridge, UK. 2008
A stopping criterion for active learning
Computer, Speech and Language, Volume 22, Issue 3, July 2008
[pdf]
Dirichlet Process Mixture Models for Verb Clustering
Proceedings of the ICML workshop on Prior Knowledge for Text and Language. 2008
[pdf]
Pyridines, pyridine and pyridine rings: disambiguating chemical name entities
in Proceedings of BERBMTM-08 at LREC-2008
Generating research websites using summarisation techniques
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Demo Session. 2008
[pdf]
Semantic classification with distributional kernels
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. 2008
[pdf]
Cascaded classifiers for confidence-based chemical named entity recognition
BMC bioinformatics. 2008
[pdf]
Linguistic Adaptations for Resolving Ambiguity
The Evolution of Language: Proceedings of the 7th International Conference (EVOLANG7), Barcelona, Spain, 12-15 March 2008
[pdf]
Bootstrapping an interactive information extraction system for FlyBase curation.
Ontologies and Text Mining for Life Sciences. 2008
[pdf]
Statistical anaphora resolution in biomedical texts
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. 2008
[pdf]
2007
I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database
Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition. 2007
[pdf]
Combining Symbolic and Distributional Models of Meaning
AAAI Spring Symposium: Quantum Interaction. 2007
[pdf]
Chinese segmentation with a word-based perceptron algorithm
Annual Meeting-Association for Computational Linguistics. 2007
[pdf]
Formalism-independent parser evaluation with CCG and DepBank
Annual Meeting-Association for Computational Linguistics. 2007
[pdf]
Improving the efficiency of a wide-coverage CCG parser
Proceedings of the 10th International Conference on Parsing Technologies. 2007
[pdf]
Linguistically motivated large-scale NLP with C&C and Boxer
Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions. 2007
[pdf]
Perceptron training for a wide-coverage lexicalized-grammar parser
Proceedings of the Workshop on Deep Linguistic Processing. 2007
[pdf]
Asknet: Automated semantic knowledge network
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE. 2007
[pdf]
Wide-coverage efficient statistical parsing with CCG and log-linear models
Computational Linguistics. 2007
[pdf]
Whose Idea Was This, and Why Does it Matter? Attributing Scientific Work to Citations.
HLT-NAACL. 2007
[pdf]
An overview of evaluation methods in TREC ad hoc information retrieval and TREC question answering
In Evaluation of Text and Speech Systems. L. Dybkjaer, H. Hemsen, W. Minker (Eds.) Springer, Dordrecht (The Netherlands). 2007
[pdf]
Annotation of chemical named entities
Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing
[pdf]
Designing and evaluating a semantic annotation scheme for compound nouns
Proc. Corpus Linguistics. 2007
[pdf]
Annotating and learning compound noun semantics
Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop. 2007
[pdf]
Evaluating and combining biomedical named entity recognition systems
Proceedings of the 2007 Workshop on Biological, translational, and clinical language processing
[pdf]
Tackling the BioCreative2 Gene Mention task with Conditional Random Fields and syntactic parsing
Proceedings of the Second BioCreative Challenge Evaluation Workshop. 2007
[pdf]
From gene names to actual genes
Proceedings of BioLINK SIG: Linking Literature, Information and Knowledge for Biology . 2007
[pdf]
Evaluating an Open Domain GRE algorithm on closed domains System IDs: CAM-B, CAM-T, CAM-BU and CAM-TU
Proceedings of the Workshop on Using Corpora for NLG: Language Generation and Machine Translation (UCNLG+ MT). 2007
[pdf]
Semantic composition with (robust) minimal recursion semantics
Proceedings of the Workshop on Deep Linguistic Processing. 2007
[pdf]
Co-occurrence contexts for noun compound interpretation
proceedings of the Workshop on A Broader Perspective on Multiword Expressions. 2007
[pdf]
Applying robust semantics
Proceedings of the 10th Conference of the Pacific Assocation for Computational Linguistics (PACLING). 2007
[pdf]
Integrating general-purpose and ___domain-specific components in the analysis of scientific text
Proc. of the UK e-Science Programme All Hands Meeting. 2007
[pdf]
Adapting the RASP system for the CoNLL07 ___domain-adaptation task
Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL. 2007
[pdf]
Weakly supervised learning for hedge classification in scientific literature
Annual Meeting of the Association for Computational Linguistics. 2007
[pdf]
A system for large-scale acquisition of verbal, nominal and adjectival subcategorization frames from corpora
Annual Meeting of the Association for Computational Linguistics. 2007
[pdf]
Semi-supervised training of a statistical parser from unlabeled partially-bracketed data
Proceedings of the 10th International Conference on Parsing Technologies. 2007
[pdf]
2006
A large-scale extension of VerbNet with novel verb classes
Atti del XII Congresso Internazionale di Lessicografia: Torino, 6-9 settembre 2006
[pdf]
Zone analysis in biology articles as a basis for information extraction
International journal of medical informatics. 2006
[pdf]
Automatic classification of verbs in biomedical texts
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006
[pdf]
Extensive classifications of english verbs
Proceedings of the 12th EURALEX International Congress. 2006
Partial training for a lexicalized-grammar parser
Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. 2006
[pdf]
Multi-tagging for lexicalized-grammar parsing
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006
[pdf]
Argumentative zoning applied to critiquing novices’ scientific abstracts
In ``Computing Attitude and Affect in Text: Theory and Applications'' James G. Shanahan, Yan Qu, Janyce Wiebe (Eds.) Springer, Dordrecht, The Netherlands, 2005. 2006
[pdf]
Argumentative zoning for improved citation indexing
In ``Computing Attitude and Affect in Text: Theory and Applications'' James G. Shanahan, Yan Qu, Janyce Wiebe (Eds.) Springer, Dordrecht, The Netherlands, 2005.. 2006
[pdf]
Creating a test collection for citation-based IR experiments
Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. 2006
[pdf]
A bootstrapping approach to unsupervised detection of cue phrase variants
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006
[pdf]
Automatic classification of citation function
Proceedings of the 2006 conference on empirical methods in natural language processing
[pdf]
How to find better index terms through citations
Proceedings of the Workshop on How Can Computational Linguistics Improve Information Retrieval?. 2006
[pdf]
Bootstrapping and Evaluating Named Entity Recognition in the Biomedical Domain
Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology. 2006
[pdf]
An architecture for language processing for scientific texts
Proceedings of the UK e-Science All Hands Meeting 2006
[pdf]
Flexible interfaces in the application of language technology to an escience corpus
Proceedings of the UK e-Science programme all hands meeting. 2006
[pdf]
Preprocessing and tokenisation standards in DELPH-IN tools
Proceedings of the 5th International Conference on Language Resources and Evaluation. 2006
[pdf]
A standoff annotation interface between DELPH-IN components
Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing. 2006
[pdf]
Conventional speech act formulae: from corpus findings to formalization
Proceedings of Constraints in Discourse, NUI Maynooth, Ireland. 2006
[pdf]
Conventional speech act formulae in HPSG
poster). 13th International Conference on Head-Driven Phrase Structure Grammar, Varna. 2006
[pdf]
Acquiring ontological relationships from wikipedia using rmrs
Proceedings of Workshop on Web content Mining with Human Language Technologies, ISWC06. 2006
[pdf]
Bootstrapping the recognition and anaphoric linking of named entities in drosophila articles
Proc. of the Pacific Symposium on Biocomputing. 2006
[pdf]
An introduction to tag sequence grammars and the RASP system parser
Computer Laboratory Technical Report. 2006
[pdf]
Annotation guidelines for Named Entity Recognition in the FlySLIP project
University of Cambridge, CRL, Cambridge. 2006
[pdf]
A large subcategorization lexicon for natural language processing applications
Proc. of the 5th LREC. 2006
[pdf]
The second release of the RASP system
Proceedings of the COLING/ACL on Interactive presentation sessions. 2006
[pdf]
Evaluating the accuracy of an unlexicalized statistical parser on the PARC DepBank
Proceedings of the COLING/ACL on Main conference poster sessions. 2006
[pdf]