Efficient Machine Learning Methods for Document Image Analysis