Text this: MKER: multi-modal knowledge extraction and reasoning for future event prediction