Text this: Video and Image Processing based on Kernel Representations