Cluster computing

Saturday, November 22, 2025

Our analytics-driven video-sensing application can elevate trip planning and trajectory feedback by integrating selective sampling, agentic retrieval, and contextual vector catalogs—mirroring Tesla’s end-to-end learning evolution while addressing real-world visibility and planning challenges.

Tesla’s transition to end-to-end deep learning marks a paradigm shift in autonomous driving: moving from modular perception and planning blocks to a unified neural architecture trained on millions of human driving examples1. This shift enables the vehicle to learn not just what it sees, but how to act—directly from video input to control output. Our application, built around analytics-focused video sensing, online traffic data, and importance sampling of vehicle-mounted camera captures, is poised to complement and extend this vision-first autonomy in powerful ways.

At the heart of our system lies importance sampling, a technique that prioritizes high-value frames from vehicle-mounted cameras. These samples—selected based on motion, occlusion, or semantic richness—form the basis of a time and spatial context catalog. This catalog acts as a dynamic memory of the trip, encoding not just what was seen, but when and where it mattered. By curating this catalog, our system can reconstruct nuanced environmental states, enabling retrospective trajectory analysis and predictive planning under similar conditions.

This is especially valuable in poor visibility scenarios—fog, glare, snow—where Tesla’s vision-only stack may struggle. Our catalog can serve as a fallback knowledge base, offering contextual overlays and inferred visibility cues drawn from prior trips and online map data. For instance, if a vehicle approaches a known intersection during a snowstorm, our system can retrieve past clear weather captures and traffic flow data to guide safer navigation.

To make this retrieval intelligent and scalable, we employ agentic retrieval, a query framing mechanism that interprets user or system intent and matches it against cataloged vectors. These vectors—derived from sampled frames, traffic metadata, and map overlays—are semantically rich and temporally indexed. When a query like “What’s the safest trajectory through this junction during dusk?” is posed; the agentic retriever can synthesize relevant samples, online traffic patterns, and historical trajectory scores to generate a response that’s both context-aware and actionable.

This retrieval pipeline mirrors Tesla’s own trajectory scoring system, which evaluates paths based on collision risk, comfort, intervention likelihood, and human-likeness1. But where Tesla’s planner relies on real-time perception and Monte Carlo tree search, our system adds a layer of temporal hindsight—judging trajectories not just by immediate outcomes, but by their alignment with cataloged best practices and environmental constraints.

Moreover, our integration of online maps and traffic information allows for dynamic trip planning. By fusing real-time congestion data with cataloged spatial vectors, our system can recommend alternate routes, adjust trajectory expectations, and even simulate outcomes under varying conditions. This is particularly useful for fleet operations or long-haul navigation, where route optimization must account for both historical performance and current traffic realities.

Our application becomes a contextual co-pilot, enhancing vision-based autonomy with memory, foresight, and semantic reasoning. It doesn’t replace Tesla’s end-to-end stack—it augments it, offering a richer planning substrate and a feedback loop grounded in selective sampling and intelligent retrieval. As Tesla moves toward unified learning objectives, our system’s modular intelligence and cataloged context offer a complementary path: one that’s grounded in analytics, enriched by data, and optimized for real-world complexity.

Friday, November 21, 2025

Our drone video analytics platform can become a force multiplier in the GEODNET–DroneDeploy ecosystem by enriching centimeter-accurate spatial data with temporal, semantic, and behavioral intelligence—unlocking new layers of insight across industries.

As DroneDeploy and GEODNET converge to make high-accuracy drone data the new default, our analytics layer can elevate this foundation into a dynamic, decision-ready intelligence stack. GEODNET’s decentralized RTK infrastructure ensures that drones flying even in remote or signal-challenged environments can achieve consistent centimeter-level accuracy. DroneDeploy, in turn, transforms this precision into actionable site intelligence through its Visualizer platform, AeroPoints, and DirtMate telemetry. Yet, what remains untapped is the rich temporal and spatial information available from the input and the public domain knowledge base — this is where our platform enters with transformative potential.

By fusing high-precision geolocation with real-time video analytics, our system can extract object-level insights that go beyond static maps. For instance, in construction and mining, our platform could track equipment movement, detect unsafe behaviors, or quantify material flow with spatial fidelity that aligns perfectly with DroneDeploy’s orthomosaics and 3D models. This enables not just post-hoc analysis but real-time alerts and predictive modeling. In agriculture, our analytics could identify crop stress, irrigation anomalies, or pest patterns with geospatial anchoring that allows for immediate intervention—turning DroneDeploy’s maps into living, learning systems.

Moreover, our expertise in transformer-based object detection and multimodal vector search can unlock new retrieval workflows. Imagine a supervisor querying, “Show me all instances of unsafe proximity between personnel and heavy machinery over the past week,” and receiving a geospatially indexed video summary with annotated risk zones. This kind of semantic search, grounded in GEODNET’s RTK precision, would be a significant change for compliance, training, and operational optimization.

Our platform also complements GEODNET’s DePIN model by generating high-value metadata that can be fed back into the network. For example, our analytics could validate GNSS signal integrity by correlating visual motion with positional drift, flagging anomalies during solar flare events, or in multipath-prone environments. This feedback loop enhances trust in the corrections layer, especially for mission-critical applications like emergency response or autonomous navigation.

In educational and regulatory contexts, our system can provide annotated video narratives that demonstrate compliance with geospatial standards or document environmental change over time. This is particularly compelling when paired with DroneDeploy’s time-series mapping and GEODNET’s auditability features, creating a transparent, defensible record of site evolution.

Our drone video analytics platform does not just ride the wave of high-accuracy data—it amplifies it. By layering semantic intelligence atop precise positioning, we help transform drone footage from a passive record into an active agent of insight, accountability, and autonomy. In doing so, we expand the ecosystem’s reach into new verticals—smart infrastructure, insurance, forestry, disaster response—and help realize the shared vision of autonomy as a utility, not a luxury.

Besides the analytics, there is also a dataset value to this confluence. Consider that most aerial drone mapping missions are flown at altitudes between 100–120 meters above ground level (AGL), yielding spatial resolutions of 2–5 cm per pixel depending on the camera and sensor setup. With Google Maps and Bing Maps providing coverage of a large part of the world, we can curate a collection of images of every part of this coverage at that scale resolution and vectorize it. Then given any aerial drone video and its salient frames vectorized, it would be easy to not only locate it in this catalog via vector similarity scores but also leverage all the temporal and spatial context and metadata available publicly from the internet about that scene to make inferences not only about the objects in the scene but also about the tour of the drone to the point where each drone can become autonomous relying only on this open and trusted data.

References:

1. previous article: https://1drv.ms/w/c/d609fb70e39b65c8/ETOqMP7TavZKsNYEqoIB-WoBjxpdTaEH9E6v4__ithM--A?e=hGswZr

2. DVSA: https://1drv.ms/w/c/d609fb70e39b65c8/EWVW6S7XZntLp3USfXIqOXIBp2KWCrNbN9b-qmNPNR2J0A?e=xR4emT

Addendum:

Sample code to standardize scale resolution in aerial drone images:

import cv2

import os

def rescale_image_to_altitude(image_path, original_gsd_cm, target_altitude_m=110, target_gsd_cm=3.5):

"""

Rescales an aerial image to simulate a new altitude by adjusting its ground sampling distance (GSD).

Parameters:

- image_path: Path to the input JPG image.

- original_gsd_cm: Original ground sampling distance in cm/pixel.

- target_altitude_m: Desired altitude in meters (default 110m).

- target_gsd_cm: Target GSD in cm/pixel for 100–120m AGL (default 3.5 cm/pixel).

Returns:

- Rescaled image as a NumPy array.

"""

# Load image

image = cv2.imread(image_path)

if image is None:

raise ValueError("Image not found or invalid format.")

# Compute scaling factor

scale_factor = original_gsd_cm / target_gsd_cm

# Resize image

new_width = int(image.shape[1] * scale_factor)

new_height = int(image.shape[0] * scale_factor)

resized_image = cv2.resize(image, (new_width, new_height), interpolation=cv2.INTER_AREA)

return resized_image

# Example usage

if __name__ == "__main__":

input_image = "drone_image.jpg"

original_gsd = 1.5 # cm/pixel at low altitude

output_image = rescale_image_to_altitude(input_image, original_gsd)

# Save the output

output_path = "rescaled_drone_image.jpg"

cv2.imwrite(output_path, output_image)

print(f"Rescaled image saved to {output_path}")

#codingexercise: Codingexercise-11-21-2025.docx

Thursday, November 20, 2025

Argositech’s software architecture represents a sophisticated convergence of drone-based spatial intelligence, AI-driven forestry analytics, and GNSS-independent mapping workflows. At its core, the platform is engineered to address the nuanced challenges of forest inventory, biomass estimation, and terrain modeling in environments where traditional GPS-based methods falter. The backbone of Argositech’s system is a proprietary SLAM (Simultaneous Localization and Mapping) engine, optimized for aerial data streams. This allows their drones to generate high-fidelity 3D reconstructions of forested landscapes even under dense canopy cover, where GNSS signals are unreliable or entirely absent. The SLAM pipeline is tightly integrated with onboard visual-inertial odometry and LiDAR fusion, enabling centimeter-level accuracy in terrain and tree modeling without the need for ground control points.

Once the spatial data is captured, Argositech’s cloud-native analytics engine takes over. This engine leverages transformer-based segmentation models trained on diverse forest biomes to classify tree species, estimate canopy density, and calculate trunk diameters and heights. The system is designed to scale across large forest parcels, automatically stitching drone flight paths into unified geospatial datasets. These datasets feed into Argositech’s biomass and carbon stock calculators, which are calibrated against regional forestry standards and REDD+ compliance frameworks. The result is a platform that not only maps forests but quantifies their ecological and economic value with precision, making it indispensable for timber valuation, reforestation tracking, and ESG reporting.

Where Argositech truly distinguishes itself is in its operational autonomy. The software supports dynamic flight path reconfiguration based on real-time terrain feedback, allowing drones to adapt to unexpected obstacles or topographic shifts. This is particularly valuable in rugged or mountainous regions, where static flight plans often fail. Moreover, the system’s edge-cloud architecture ensures that preliminary analytics—such as tree count and canopy health—can be performed on-device, while deeper modeling and historical comparisons are offloaded to the cloud. This hybrid approach balances latency-sensitive tasks with compute-intensive workflows, optimizing both speed and accuracy.

Our drone image analytics software, with its advanced cloud orchestration and multimodal vector search capabilities, could significantly augment Argositech’s ecosystem. By integrating our transformer-based object detection pipelines, Argositech could enhance its species classification accuracy and extend its analytics to include wildlife detection and habitat mapping. Our expertise in agentic retrieval and reentrant CLI scripting would also streamline Argositech’s data migration and index management processes, particularly as they scale across geographies and regulatory frameworks. Furthermore, our benchmarking narratives and strategic synthesis could help position Argositech more competitively against platforms like DroneDeploy and Esri, especially in the emerging space of biodiversity quantification and carbon credit verification.

In essence, Argositech’s technology is a masterclass in resilient, intelligent forestry analytics. And Our software—designed for robust, scalable aerial vision workflows—offers the perfect complement: extending their reach from trees to terrain, from canopy to creature, and from mapping to meaning. Together, the synergy could redefine how forests are understood, valued, and protected in the age of autonomous observation.

#Codingexercises:

1. https://1drv.ms/w/c/d609fb70e39b65c8/EWzuM9juscxBrKYbQeLxHhIBtuv2r0aUL7EPA5VT54I4oA?e=Xd8cso

2. https://1drv.ms/w/c/d609fb70e39b65c8/EavUwi2CkY9DjCwlEdapdjcBb-UHQQ3IfVQMdv7Vh18x2g?e=X2Cc5G

Wednesday, November 19, 2025

The following script calculates the number of images required for a drone to cover the United States ground area so that each image can be vectorized and used for analysis.

def calculate_num_images(

us_area_m2=9.15e12, # U.S. land area in square meters

image_width_px=5472, # width of image in pixels (20MP camera default)

image_height_px=3648, # height of image in pixels

gsd_m=0.03, # ground sampling distance in meters/pixel (3 cm/pixel at 100m AGL)

frontlap=0.70, # front overlap fraction (e.g., 70%)

sidelap=0.70 # side overlap fraction (e.g., 70%)

"""

Calculate the number of drone images required to cover a given area.

Parameters:

- us_area_m2: total area to cover (default: U.S. land area ~9.15e12 m^2)

- image_width_px, image_height_px: image resolution in pixels

- gsd_m: ground sampling distance in meters/pixel

- frontlap, sidelap: overlap fractions (0.0–1.0)

Returns:

- num_images: estimated number of images required

- footprint_raw: raw footprint area per image (m^2)

- footprint_eff: effective new coverage per image with overlap (m^2)

"""

# Ground footprint dimensions

width_m = image_width_px * gsd_m

height_m = image_height_px * gsd_m

# Raw footprint area

footprint_raw = width_m * height_m

# Effective coverage per image with overlap

footprint_eff = footprint_raw * (1 - frontlap) * (1 - sidelap)

# Number of images required

num_images = us_area_m2 / footprint_eff

return num_images, footprint_raw, footprint_eff

if __name__ == "__main__":

# Example with defaults

num_images, raw_area, eff_area = calculate_num_images()

print(f"Raw footprint per image: {raw_area:,.0f} m^2")

print(f"Effective coverage per image (with overlap): {eff_area:,.0f} m^2")

print(f"Total images required: {num_images:,.0f}")

# Try different parameters

num_images_alt, _, _ = calculate_num_images(gsd_m=0.025, frontlap=0.65, sidelap=0.65)

print(f"\nWith 2.5 cm/pixel GSD and 65% overlap:")

print(f"Total images required: {num_images_alt:,.0f}")

def calculate_storage_cost(

total_images,

image_size_kb=26, # size per image in KB

tier1_limit_tb=50, # first tier limit in TB

tier1_price_per_gb=0.018, # USD per GB for first 50 TB

tier2_limit_tb=450, # second tier limit in TB

tier2_price_per_gb=0.0173 # USD per GB for next 450 TB

"""

Calculate monthly Azure storage cost for given number of images.

Parameters:

- total_images: number of images to store

- image_size_kb: size of each image in KB (default 26 KB)

- tier1_limit_tb: size of first pricing tier in TB (default 50 TB)

- tier1_price_per_gb: price per GB for first tier

- tier2_limit_tb: size of second pricing tier in TB (default 450 TB)

- tier2_price_per_gb: price per GB for second tier

Returns:

- total_cost: monthly cost in USD

- total_storage_tb: total storage required in TB

"""

# Convert image size to bytes

image_size_bytes = image_size_kb * 1024

total_bytes = total_images * image_size_bytes

# Convert to GB (binary, 1 GB = 2^30 bytes)

total_gb = total_bytes / (2**30)

total_tb = total_gb / 1024

# Calculate tiered cost

cost = 0.0

remaining_gb = total_gb

# Tier 1

tier1_limit_gb = tier1_limit_tb * 1024

if remaining_gb > 0:

gb_in_tier1 = min(remaining_gb, tier1_limit_gb)

cost += gb_in_tier1 * tier1_price_per_gb

remaining_gb -= gb_in_tier1

# Tier 2

tier2_limit_gb = tier2_limit_tb * 1024

if remaining_gb > 0:

gb_in_tier2 = min(remaining_gb, tier2_limit_gb)

cost += gb_in_tier2 * tier2_price_per_gb

remaining_gb -= gb_in_tier2

# Beyond tier 2 (if needed, assume same as tier2 price)

if remaining_gb > 0:

cost += remaining_gb * tier2_price_per_gb

return cost, total_tb

if __name__ == "__main__":

# Example: 5.66 billion images at 26 KB each

total_images = int(5.66e9)

cost, storage_tb = calculate_storage_cost(total_images)

print(f"Total storage required: {storage_tb:,.1f} TB")

print(f"Monthly cost: ${cost:,.2f}")

# Try with a smaller dataset

test_images = 1_000_000

cost_test, storage_tb_test = calculate_storage_cost(test_images)

print(f"\nFor {test_images:,} images:")

print(f"Storage required: {storage_tb_test:.3f} TB")

print(f"Monthly cost: ${cost_test:.2f}")

Tuesday, November 18, 2025

Deploying Langfuse with Azure Active Directory authentication:

When deploying Langfuse via Helm with Azure Active Directory authentication for its users, recommendations and preferences mainly focus on correct Azure AD configuration, security practices, and provider settings. There does not appear to be a major preference for one Helm chart over another—the official Langfuse Helm chart is the standard. The following best practices and considerations are recommended.

1. Use the official Langfuse Helm chart for Kubernetes deployment and set the Azure AD provider configuration in values.yaml as per Langfuse documentation.

2. Supply the Azure AD client ID, client secret, and tenant ID as environment variables or as Helm chart values to ensure correct SSO setup. For example,

nextauth:

secret:

value: "<your-nextauth-secret>"

providers:

azure-ad:

enabled: true

clientId: "YOUR_CLIENT_ID"

clientSecret: "YOUR_CLIENT_SECRET"

tenantId: "YOUR_TENANT_ID"

Ideally, the clientId, clientSecret, and tenantId would be stored as Kubernetes secrets and references in the values.yaml file.

3. Set the OAuth Callback URL in your Azure AD application to /api/auth/callback/azure-ad and confirm it matches your deployed application's endpoint. The OAuth redirect URI must be kept in sync between Azure AD, the Helm values, and the deployed Langfuse instance to ensure proper authentication flow.

4. Disable other authentication providers when going with one of the providers such as Azure AD.

Langfuse provides role-based access control (RBAC) that works with SSO authentication providers like Azure Active Directory (Azure AD), enabling fine-grained authorization for users in your organization. Langfuse can be deployed so that only Azure AD users belonging to a specific Azure AD group are allowed to log in and access the UI. Roles can be assigned at both the Organization (“Owner”, “Admin”, “Member”, “Viewer”, “None”) and Projects isolations scopes.

Azure AD Group Membership can be enforced for Langfuse UI access by registering Langfuse as an application in Azure AD and specifying users and groups on the Azure portal page to manage it. That same Enterprise Application must have Microsoft Graph “GroupMember.Read.All” and included in AD App permissions with Admin Consent property set. Register a custom handler that validates the token and its claims from the redirect received from Microsoft AD to ensure that the user is part of the group.

This would look something like this:

import AzureADProvider from "next-auth/providers/azure-ad";

const REQUIRED_GROUP_ID = process.env.AZURE_AD_REQUIRED_GROUP_ID;

export const authOptions = {

providers: [

AzureADProvider({

clientId: process.env.AZURE_AD_CLIENT_ID,

clientSecret: process.env.AZURE_AD_CLIENT_SECRET,

tenantId: process.env.AZURE_AD_TENANT_ID,

// Ensure you add "groups" claim in Azure AD app registration token configuration!

}),

// ...other providers

callbacks: {

async signIn({ user, account, profile }) {

// The "groups" claim is present in profile if Azure AD is configured to emit it

const allowedGroups = profile.groups || [];

// You may also need to handle profile.groups as string array or as object depending on Azure config

if (allowedGroups.includes(REQUIRED_GROUP_ID)) {

return true;

}

// Optionally log denied access attempts

return false;

// Optionally pass group data into the session

async session({ session, token }) {

session.groups = token.groups || [];

return session;

async jwt({ token, account, profile }) {

if (profile?.groups) {

token.groups = profile.groups;

}

return token;

};

// Export NextAuth handler:

export default NextAuth(authOptions);

This would then be deployed on the Langfuse’s instance via a configmaps like this:

apiVersion: v1

kind: ConfigMap

metadata:

name: langfuse-nextauth-patch

data:

patch-nextauth.ts: |

// (The TypeScript patch code from above is inserted here)

And

a volume and mount for the ConfigMap are added to the langfuse-web pod's deployment spec:

spec:

containers:

- name: langfuse-web

# ...existing config...

volumeMounts:

- name: nextauth-patch

mountPath: /app/pages/api/auth/patch-nextauth.ts

subPath: patch-nextauth.ts

volumes:

- name: nextauth-patch

configMap:

name: langfuse-nextauth-patch

Here, a caveat must be mentioned that an init container, startup hook, or a custom entry script to overwrite/meld patch-nextauth.ts on the […nextauth].ts file, depending on the chosen image build and deployment workflow. Alternatively, a custom image can be build using this file as a replacement in the application source tree, referencing the ConfigMap as the build context.

#codingexercise: Drone-aerial-images-count.py.docx

Monday, November 17, 2025

It’s surprising that vector stores make it difficult to export and import vectors while models which also comprise of vectors are available to download. It seems as if the vectors are not really data that can be exported and imported and that every vector store must treat its data as proprietary without support for interoperability as first class data type.

Therefore, the following scripts assist in taking backups of your data from an Azure AI Search resource to an Azure storage account for say 70,000 entries in the index each with 1536 dimension vector field with a total index size of just over a GigaByte.

Step 1. Export the schema:

#! /bin/bash

# Variables

search_service="srch-vision-01"

index_name="index007"

resource_group="rg-ctl-2"

schema_file=$(echo index-"$index_name"-schema.json)

echo $search_service

echo $index_name

echo $resource_group

echo $schema_file

# Get admin key

admin_key=$(az search admin-key show --service-name $search_service --resource-group $resource_group --query primaryKey --output tsv)

echo $admin_key

# Export schema using REST API

curl -X GET "https://$search_service.search.windows.net/indexes/$index_name?api-version=2023-10-01-Preview" \

-H "api-key: $admin_key" \

-H "Content-Type: application/json" \

-o $schema_file

echo "schema exported"

Step 2. Export the data:

#! /bin/bash

# Export one document at a time using REST API and loop

# Variables

search_service="srch-vision-01"

index_name="index007"

resource_group="rg-ctl-2"

storage_account="sadronevideo"

container_name="metadata"

total_docs=27

api_version="2023-10-preview"

echo $search_service

echo $index_name

echo $resource_group

echo $storage_account

echo $container_name

echo $total_docs

# Get admin key

admin_key=$(az search admin-key show --service-name $search_service --resource-group $resource_group --query primaryKey --output tsv)

echo $admin_key

storage_key=$(az storage account keys list \

--account-name $storage_account \

--resource-group $resource_group \

--query "[0].value" --output tsv)

echo $storage_key

for ((i=0; i<$total_docs; i++)); do

file_name="doc_$i.json"

blob_name="indexes/$index_name/data/$file_name"

# Check if blob already exists

exists=$(az storage blob exists \

--account-name $storage_account \

--account-key $storage_key \

--container-name $container_name \

--name $blob_name \

--query exists --output tsv)

if [ "$exists" == "true" ]; then

echo "Skipping export for doc $i (already exists in blob)"

continue

# Export one document

curl -s -X POST "https://$search_service.search.windows.net/indexes/$index_name/docs/search?api-version=2023-10-01-Preview" \

-H "api-key: $admin_key" \

-H "Content-Type: application/json" \

-d "{\"search\":\"*\",\"top\":1,\"skip\":$i}" \

| jq '.value[0]' > "$file_name"

# Upload to blob

az storage blob upload \

--account-name $storage_account \

--account-key $storage_key \

--container-name $container_name \

--name $blob_name \

--file $file_name

# Clean up local file

rm "$file_name"

done

Step 3: Import the schema:

#! /bin/bash

# Variables

search_service="srch-vision-01"

index_name="index007"

dest_index_name="$index_name"copy

resource_group="rg-ctl-2"

storage_account="sadronevideo"

container_name="metadata"

total_docs=2

api_version="2023-10-preview"

echo $search_service

echo $index_name

echo $dest_index_name

echo $resource_group

echo $storage_account

echo $container_name

echo $total_docs

# Get admin key

admin_key=$(az search admin-key show --service-name $search_service --resource-group $resource_group --query primaryKey --output tsv)

echo $admin_key

storage_key=$(az storage account keys list \

--account-name $storage_account \

--resource-group $resource_group \

--query "[0].value" --output tsv)

echo $storage_key

exists=$(az storage blob exists \

--account-name $storage_account \

--account-key $storage_key \

--container-name $container_name \

--name $blob_name \

--query exists --output tsv --only-show-errors)

if [ "$exists" != "true" ]; then

echo "Skipping import for schema $blob_name (blob missing)"

exit

file_name="$index_name"-schema.json

echo $file_name

# Download blob

az storage blob download \

--account-name $storage_account \

--account-key $storage_key \

--container-name $container_name \

--name $blob_name \

--file $file_name \

-o none

schema_exists=$(curl -X GET "https://$search_service.search.windows.net/indexes/$dest_index_name?api-version=2023-10-01-Preview" \

-H "api-key: $admin_key" \

-H "Content-Type: application/json" \

| jq -r 'if .error then "false" else "true" end')

if [ "$exists_in_index" == "true" ]; then

echo "Skipping import for schema (already exists in index)"

rm "$file_name"

continue

sed -i "s/$index_name/$dest_index_name/g" "$file_name"

curl -X PUT "https://$search_service.search.windows.net/indexes/$dest_index_name?api-version=2023-10-01-Preview" \

-H "api-key: $admin_key" \

-H "Content-Type: application/json" \

--data-binary "@$file_name"

echo "schema imported"

Step 4: Import the data:

#! /bin/bash

# Export one document at a time using REST API and loop

# Variables

search_service="srch-vision-01"

index_name="index007"

dest_index_name="$index_name"copy

resource_group="rg-ctl-2"

storage_account="sadronevideo"

container_name="metadata"

total_docs=27

api_version="2023-10-preview"

echo $search_service

echo $index_name

echo $dest_index_name

echo $resource_group

echo $storage_account

echo $container_name

echo $total_docs

# Get admin key

admin_key=$(az search admin-key show --service-name $search_service --resource-group $resource_group --query primaryKey --output tsv)

echo $admin_key

storage_key=$(az storage account keys list \

--account-name $storage_account \

--resource-group $resource_group \

--query "[0].value" --output tsv)

echo $storage_key

for ((i=0; i<$total_docs; i++)); do

file_name="doc_$i.json"

blob_name="indexes/$index_name/data/$file_name"

# Check if blob exists

exists=$(az storage blob exists \

--account-name $storage_account \

--account-key $storage_key \

--container-name $container_name \

--name $blob_name \

--query exists --output tsv)

if [ "$exists" != "true" ]; then

echo "Skipping import for doc $i (blob missing)"

continue

# Download blob

az storage blob download \

--account-name $storage_account \

--account-key $storage_key \

--container-name $container_name \

--name $blob_name \

--file $file_name \

-o none

if [ ! -f "$file_name" ]; then

echo "Skipping import for doc $i (download failed)"

continue

# Extract document ID

doc_id=$(jq -r '.["@search.documentKey"] // .id // .Id // .ID' "$file_name")

if [ -z "$doc_id" ]; then

echo "Skipping import for doc $i (missing ID)"

rm "$file_name"

continue

echo $doc_id

# Check if document already exists in index

exists_in_index=$(curl -s -X GET "https://$search_service.search.windows.net/indexes/$dest_index_name/docs/$doc_id?api-version=2023-10-01-Preview" \

-H "api-key: $admin_key" \

-H "Content-Type: application/json" \

| jq -r 'if .error then "false" else "true" end')

if [ "$exists_in_index" == "true" ]; then

echo "Skipping import for doc $i (already exists in index)"

rm "$file_name"

continue

# jq 'with_entries(select(.key != "id"))' "$file_name" > "filtered_$file_name"

jq '{value: [.]}' "$file_name" > "filtered_$file_name"

# Import to index

curl -s -X POST "https://$search_service.search.windows.net/indexes/$dest_index_name/docs/index?api-version=2023-10-01-Preview" \

-H "api-key: $admin_key" \

-H "Content-Type: application/json" \

--data-binary "@filtered_$file_name"

# Clean up local file

rm filtered_"$file_name"

rm "$file_name"

done

Errors encountered that are already addressed by the script:

1. Api-Version must match:

{"error":{"code":"","message":"Invalid or missing api-version query string parameter."}}

2. The document downloaded has metadata so the data is only taken from the value field. Evident from the messages during import:

a. {"error":{"code":"","message":"The request is invalid. Details: The parameter 'id' in the request payload is not a valid parameter for the operation 'index'."}}

b. {"error":{"code":"","message":"The request is invalid. Details: The parameter 'description' in the request payload is not a valid parameter for the operation 'index'."}}

Conclusion: A practice of moving data from Azure AI search resource to storage account itself for an index of size 1GB itself saves a hundred dollars every month in the billing not to mention the benefits of aging, tiering, disaster recovery and other benefits.

Sunday, November 16, 2025

Autodesk Revit has evolved from a building information modeling (BIM) tool into a powerful engine for urban planning, enabling planners to simulate, analyze, and orchestrate complex city-scale designs. Our drone image analytics software can enrich Revit’s workflows by supplying high-resolution geospatial intelligence, dynamic infrastructure mapping, and real-time environmental context for resilient and sustainable urban development.

Revit’s core strength lies in its parametric modeling and data-rich architecture, which allows urban planners to move beyond static blueprints into dynamic, scenario-driven simulations. In the context of urban planning, Revit is used to model entire precincts, infrastructure networks, and public spaces with precision and adaptability. Its integration with Civil 3D and connected city data platforms enables planners to automate master planning workflows, reduce manual errors, and visualize the impact of design decisions across time and scale. Revit’s ability to link design data with open datasets and financial city models allows for real-time feasibility analysis, stakeholder engagement, and strategic planning. Planners can simulate zoning changes, transportation flows, and environmental impacts while maintaining a single source of truth across disciplines.

One of the most transformative aspects of Revit in urban planning is its support for scenario simulation. Planners can generate multiple development options, test them against regulatory constraints, and visualize outcomes in immersive 3D environments. This capability is especially valuable for public space design, utility optimization, and resilience modeling. Revit’s data management layer ensures that every component—from building footprints to green infrastructure—is tagged, searchable, and interoperable with GIS and web-based analytics platforms. The result is a planning environment that is not only visually rich but also analytically rigorous.

Our drone image analytics software can serve as a strategic complement to Revit’s urban planning workflows. By supplying high-resolution aerial imagery and transformer-based object detection, we can help planners validate existing conditions, monitor construction progress, and assess infrastructure health with unmatched granularity. Our clustering algorithms can be used to identify patterns in land use, vegetation coverage, and traffic density, feeding directly into Revit’s scenario models. Moreover, our edge-cloud architecture allows for real-time ingestion of drone data into Revit-compatible formats, enabling planners to update models dynamically as cities evolve.

In master planning contexts, our platform can automate the generation of terrain models, detect zoning violations, and classify urban features such as sidewalks, drainage systems, and informal settlements. These insights can be embedded into Revit’s parametric environment, allowing planners to simulate interventions and measure their impact before implementation. Our software’s ability to annotate and vectorize aerial imagery also supports Revit’s sustainability modules, helping planners integrate green infrastructure and optimize solar exposure, stormwater management, and pedestrian accessibility.

As Revit continues to expand its role in connected city platforms and data-driven urban analytics, our drone image analytics software offers a future-proof extension that bridges the physical and digital realms. Together, they can enable cities to plan smarter, build faster, and adapt more resiliently to the challenges of climate, population, and infrastructure stress.

#codingexercise: