In camera matrix the focal lengths fx,fy are expressed in pixel units. To convert focals in World units Fx, Fy you need sensor size in same units using similar triangle

Diamond turningoptics

More in general, you need to calculate extrinsic parameter, rotation and translation matrix for the scene you are viewing. You can use some well know markers in the scene you are viewing. See real-world-camera-position-knowing-marker-geometry

Diamond turningTool

I am trying to solve the same problem but I do not know the camera sensor width (W) nor height (H). The camera I am using is the built-in back camera of the Samsung Galaxy A5 (2015). The only information I found are the focal length (3.69 mm), the focal length 35mm (28mm) and the image resolution is 2448x3264. Is there any way to transform the intrinsic matrix from pixel to world unit using these info? Extracting the ratios W/w and H/h by doing W/w = Fx/fx seems odd to me because I am assuming that Fx=Fy=3.69mm.

To transform from pixel to world you need to know the object distance and orientation too. N pixels can refer to small near object or large far object, rotation of camera axis changes resulting size of same object at same distance

I am trying to calibrate my camera (sony alpha 6000 20mm lens) using open CV. In the parameters list, I can find the focal length coefficient as 5192. How can I calculate the corresponding focal length from that ?