Probably AGS can't recognise what you want, to end cutscene or to pass the current speech line (because you set mouse or key-- no auto remove). Anyway, i think you should set mouse only and to increase speech's text speed (game.text_speed variable) for to be seemed like the player must click for removing the speech.
I hope it helped a bit
I hope it helped a bit