- Can we generate questions of various types? - [ ] Can we label the type of question using a language model? - [ ] Can we come up with prompts that get a LM to generate questions of each type? - For LLMs that offer logprobs, we can evaluate the logprob of question in context.