Text this: An association rule based model for information extraction from protein sequence data