Cursor as a model-neutral test bed: frontier labs send their newest models to Cursor pre-release to tune them inside the same agent harness every model shares.