in my opinion, the perspective problem comes from the fact that the houses are drawn from a viewpoint that is set very high (above first floor level) looking down, whereas the temple viewpoint is about in the middle of the door, looking up.
So that would be a problem that is difficult to fix with a 2D-Program. However, if the temple is made a LOT bigger, even more than Ali did (with the door about as high as the houses), then it will
a) have a more "correct" feel and
b) the temple will make players cower in fear...

(maybe you could do one of these upward camera sweeps

)
But otherwise the stle looks very professional, and I love the details