[Source code] Conditional random field average gradient method for random field training

We apply stochastic average gradient (SAG)algorithms for training conditional random fields (CRFs).

We describe a practical implementation thatuses structure in the CRF gradient to reduce the memory requirement of thislinearly-convergent stochastic gradient method, propose a non-uniform samplingscheme that substantially improves practical performance, and analyze the rateof convergence of the SAGA variant under non-uniform sampling.

Our experimental results reveal that ourmethod often significantly outperforms existing methods in terms of thetraining objective, and performs as well or better than optimally-tunedstochastic gradient methods in terms of test error.

Conditional random fields (CRFs) are aubiquitous tool in natural language processing.

They are used for part-of-speech tagging ,semantic role labeling , topic modeling , information extraction , shallowparsing , named-entity recognition , as well as a host of other applications innatural language processing and in other fields such as computer vision.

