foundation models as a parser