Here is another point..
1) Shutter. Maybe it is not FPS only but Shutter makes the difference
With 24p you shoot with 1\48 ... Stop frame at 1\48 is a blur.. here or there or everywhere.. so constant blur from frame to frame gives smooth action.
With 48p you shoot at 1\96 at least.. Stop frame at 1\96 is likely to be sharp.. at least half will be sharp.. Depends on lens of course but chances for shart "photo-like" frame are much higher.
And that is where all those "sets look like sets" come.
2) DOF. Another trap comes from DOF. For 3d they need focus everywhere, so that eye can then see 3d and blur himself..
And that is where another trap "sets look fake" come. In 2d they would be blured away , no one would look there.. But now it is all sharp.. oops
Just my 5 cents