In this blog, we explore single-image photogrammetry – which is the extraction of information from a single image – including measurements and creation of 3D models. As we delve into this topic, we are learning a lot about the practical nuances of applying these techniques to solve real-world problems. We are currently working on calculating the real-life size of a Ganges river dolphin from the drone footage we acquired from our fieldwork in Bihar earlier this year. This blog continues from our previous post on measuring object size using a nadir image. While that post covered theoretical formulas, here we focus on the practical application of those formulas.
To get the real-life size of our dolphin, we would need to know the number of pixels the dolphin is made up of in an extracted frame (the image), along with the size of each pixel in some real-life units (eg: metres, centimetres). We would then multiply the number of pixels with the pixel real-life size to get estimates on area covered, breadth, and length. This real-life size that each pixel corresponds to is known as the Ground Sampling Distance or the GSD. The GSD is an extremely useful metric for any georeferenced image since it represents the distance between two consecutive pixel centres measured on the ground, or the distance a side of a pixel represents.
This exploration is driven by the absence of a reference object of a known size in any of our drone footage acquired during our fieldwork from earlier this year. It's challenging to have a reference object consistently in the frame since the drone moves with the dolphin sightings, which are sporadic and spaced out. If there had been a reference object, we could have used the ratio between the object's pixel count and its known size to establish a scale. This scale would then be used to determine the real-life size of a dolphin based on its pixel count.
Theoretically, measuring the GSD of a nadir image is straightforward. One of the formulas we can use to figure out the GSD is as follows, taken from our aforementioned blog:
GSD = A_L/ I_L
where
A_L is the real-life length of the area being captured in an image, in metres
I_L is the number of pixels that make up the length of that image
To calculate A_L we can use the following formula:
A_L= 2 * H * tan(FOV_L/ 2)
where
H is the altitude of the drone, in metres
FOV_L is the angular field of view along the length axis of the image
I_L and H are parameters that are easy to calculate. While I_L can be figured out by just examining the number of pixels making up the length of the image, H is a parameter captured by the drone itself as metadata. Finding the FOV_L for our drones though – took quite some time!
This parameter, more generally called the field of view (FOV), is used to calculate the GSD of our drone images using the above formula. We tried to figure out the FOV of our drone cameras by exploring forums and official drone manuals, which inevitably led us to some extremely interesting finds—especially regarding the factors and camera settings that might affect a drone's FOV.
So let’s take a look at what exactly is the FOV, how can we find the FOV for a given drone camera, and what all settings would one have to take into account if they are measuring the FOV on their own.
WHAT IS THE FIELD OF VIEW?
The term field of view refers to the viewable area across a specific axis as seen in an image. It is the amount of area that a particular lens system can capture in an image. The larger the FOV, the more area that can be seen and captured by the camera. Inversely, the smaller the FOV, the less is the area that is being seen. Thus, FOV is directly proportional to the extent or amount of area being captured by a camera.
This term usually refers to either the actual physical area that is captured (in units such as mm) or the angle at which a lens is capturing that area. In this context, we will be using FOV to refer to the angular extent of the captured image. Thus, it should be assumed to be expressed in degrees for the rest of this blogpost.
For any given image, there are multiple types of FOVs. These FOVs are specific to the axis being considered while measuring the area seen by a lens. By axis here, we are referring to an imaginary line that travels across the diagonal, width, or length of an image. Corresponding to these dimensions, there are three distinct FOVs: FOV_D, FOV_W, and FOV_L, respectively. We will be referring to these collectively as FOVs/ FOV unless specified otherwise.
To gain a deeper intuition about this term we can carry out a simple exercise.
Pull out your mobile phone and open your camera to photo mode. Try to keep your phone fixed in a particular position and observe how the edges of the area being captured in your screen change when you do the following:
Change aspect ratio of picture
Change from photo to video mode
Change from HD to 4K or to some other photo/video quality setting
FACTORS THAT AFFECT A DRONE’S FIELD OF VIEW
One’s first instinct in figuring out the FOV of their drone would be to simply check the official manual for the mentioned specs. Since we use off-the-shelf drones for our work, understanding any limitations was also crucial. As it turns out, the specs in the official DJI manuals aren’t specific or exhaustive. The official DJI manuals don’t really mention how the FOV changes with different camera settings but rather usually mention a single default FOV. This FOV measurement indicates the diagonal FOV and applies only to the native aspect ratio of that drone’s camera while taking pictures with that drone.
Consider this scenario. Say you need the FOV for when you are recording a video at 60 fps with a 16:9 aspect ratio at resolution 3840 x 2160. The first thought would be to apply the manual mentioned FOV and proceed. But that would be incorrect as based on the settings you are applying, the FOV also tends to change. Chances are that you won’t be able to simply search online for the FOV for your particular combination of settings or even find it for these specific settings in the manual. Rather, you will have to try and figure it out on your own.
The discussion around how to build an experiment to measure the FOV of a drone at some given settings is best left for another day. For now, let’s take a look at some of the factors that change the FOV of the drone’s camera.
Aspect Ratio
The aspect ratio is the ratio of width of the image to its height. It is typically expressed as two numbers separated by a colon, such as 4:3 or 16:9. Camera sensors, devices inside the camera that capture light to create an image, are manufactured in various shapes and sizes, each having a native aspect ratio. The native aspect ratio is determined by the number of pixels along the width and height of the sensor. When shooting in the sensor's native aspect ratio, the entire sensor area is used, maximising the resolution and image quality.
When we shift from one aspect ratio to another, we will usually notice a change in the extent of the area being captured in the frame. This occurs because the camera either crops the image or resizes it to fit the desired aspect ratio. Cropping involves trimming parts of the image, effectively reducing the number of pixels used from the sensor. For instance, changing from a 3:2 to a 1:1 (square) aspect ratio would mean cutting off parts of the image on the sides. Since the extent of area being captured is changing, the FOVs end up changing too.
Consider these two images captured by the same drone camera with different aspect ratios:
Zoom Level
This is one of the more obvious factors. The more you zoom in when taking an image, the lesser is the physical area you are capturing in the image. The more zoomed out the frame is, the more is the extent of the area being captured by the camera. As the captured area changes, so does the FOV.
Video Mode
If there are additional modes in which you can record your video, then FOVs for different modes might vary significantly. Even when the resolution and aspect ratio remain the same, a change in the recording mode can alter the FOVs. Take the DJI Mavic 2 Pro as an example. There are two different modes for recording a 4K video: Full FOV mode and the HQ mode. Videos recorded using these modes have the same resolution, 3840 x 2160. However, both of these have different FOVs!
In the Full FOV mode, the full camera sensor is used, more or less, followed by a downsampling of the video to a 4K resolution. In the HQ mode, however, a cropped portion of the sensor is used to capture video in 4K resolution² directly. This leads to different portions of the camera sensors being used and therefore, different amounts of areas being captured. Thus, their FOVs are different too.
Frames per second
Commonly written as fps, it is the measurement of how many individual image frames appear in one second of video. The higher the fps, the more frames there are in a video, and the smoother that video appears.
This was surprising to be honest. One wouldn’t expect the FOV to change with a change in fps. While fps is simply the number of frames recorded in a video, FOV is a completely different concept that just governs the amount of area a camera captures. They are theoretically supposed to be independent of each other. However, for some drones, changing the fps at which you are recording a video changes the FOV.
Take the AIR2S ³ ⁴ for example. Changing the fps while recording a video will change FOVs of the resultant video. The higher the fps the more frames the drone is recording and processing. If recording a high quality video at 4K resolution, then the drone might lack computational power to process all the frames at the same time. As a fix, the drone crops the video to reduce the data/information that the drone is processing. Because the video is cropped, the FOV is also reduced.
In conclusion, there are many practical nuances that one might overlook when focusing on the theoretical aspects of a problem. Listed in this blogpost are just some of the factors that affect the FOV when working with drones. Building equations to model how all the above factors influence FOVs could be a fascinating challenge. Creating a parametric equation would be a valuable tool for effective estimation of FOVs, but it would require significant effort to collect data across all the setting combinations via multiple field experiments for a number of different drones. Moreover, there would be a minor problem where for every combination of camera settings that one employs- one would need to understand how that setting exactly affects the FOV, which might not be a trivial exercise. Therefore, instead of taking that route, we are only trying to measure the FOVs for the settings which we commonly employ during field work. We have worked on some interesting field work and experiments for this – and look forward to discussing this soon!
REFERENCES
1- Energy-Aware Dynamic 3D placement of Multi-Drone Sensing fleet. Sensors, 21(8), 2622. https://doi.org/10.3390/s21082622 , Luo, Y., & Chen, Y. (2021)
2- DJI Mavic 2 Pro 4K HQ vs Full FOV EXPLAINED + TESTS
3- DJI Air 2S video crop at high fps?
4- Air 2S FOV - 4K 30FPS VS. 4K 60FPS
And some Helpful Links