Iphone + GPT-4 Vision API = Autonomous Security Cam System

preview_player
Показать описание
Iphone + GPT-4 Vision API = Autonomous Security Cam System

👊 Become a member and get access to GitHub:

Get a FREE 45+ ChatGPT Prompts PDF here:
📧 Join the newsletter:

🌐 My website:

I create a autonomous system with an Iphone and the new GPT-4 Vision API that logs and updates changes in the images taken by the Iphone. The system sends an update log to the user via e-mail.

00:00 Iphone GPT-4 Vison Intro
00:14 Iphone Setup
00:33 Iphone GPT-4 Vision Flowchart
01:37 Python Code
05:08 Running the Iphone GPT4V System
Рекомендации по теме
Комментарии
Автор

Imagine being a security surveillance guard watching this. The amount of jobs that is going out soon is crazy

NextGenart
Автор

Cool idea well executed. Those "Let's go back and look at last week's CCTV" from TV shows are going to look very quaint

yoagcur
Автор

Can you please provide the code so we can test ourselves. Thanks!

Zenthara
Автор

Imagine the possibilities of doing something multimodal like this with a super fine-tuned local LLM where you can compare images generated every second.

asastudios
Автор

Comparing image to image is a good idea.. I had it describe the initial image, and used the description with prompting to check for differences. Prompted to look for any issues. Just like telling someone to look out for any unusual activity.

Canna_Science_and_Technology
Автор

Kris, I had my physical AI robot watch this video with me!

I'm sending frames of image data, and telling GPT it's the robot's CAMERA POV. It's such a cool experience to have a video-watching buddy! (He likes your tripod setup!)

geekymonkey
Автор

Great demo. I was thinking about something similar whereby you compare images to a ring doorbell output: So when new image detected compare to my known stored images (wife, kids etc), then tells me who is at door. Also if not in my known images then categorize (postman, deliveroo, charity collector, unknown) etc..

steveatkinson
Автор

This is what ive wanted to work on since i heard vision as a thing

dekumutant
Автор

Great, practical video. Another interesting take on a system like this could be one where the snapshots are triggered by motion instead of (only) time. This would probably be more power-consuming, especially for an iPhone, but thinking of existing off-the-grid systems like wildlife and game trail cameras, it seems possible using some different devices.

lauridskristensen
Автор

Very good . Thanks so much i really liked this beautiful and simple app using AI. You can carry out numerous apps using AI .

ziadnahdi
Автор

How are you taking images with the iPhone every 5 minutes and how do you automatically upload them to Google Drive are you using some sort of iOS Shortcut or?

oaklyfoundation
Автор

What country do you live in? It's amazing look 😍

primobilingue
Автор

I’m more interested to know how much does the open API cost for doing 1 hour of analysis ?

adamchan
Автор

Do you want to share your code
I’m interested how to combine / blend google drive + gpt or how gpt get access to my google drive

cucciolo
Автор

Interesting idea. Really nice seeing it can be used like that. Not sure this is legal in EU though.

funnelfpv
Автор

I wouldnt really show the location that you live in, someone skilled could most certainly dox you from this. Be safe man <3

oaklyfoundation
Автор

📝 Summary of Key Points:

📌 The video demonstrates how to use an iPhone and the GPT 4 Vision API as a security camera. It involves taking images every 5 minutes, uploading them to a Google Drive account, and using the Google Drive API to download the latest images.
🧐 The downloaded images are processed using the GPT 4 Vision API to compare them and detect changes. The changes are logged and summarized using the GPT 4 API, and an email report is generated using the Mailgun API.
🚀 The Python code for the project includes functions for authentication, downloading the latest image, analyzing image comparisons, and sending emails. A loop runs every 350 seconds to check for new images and perform the necessary comparisons and logging.

💡 Additional Insights and Observations:

💬 "The project worked as intended and the author was satisfied with the results."
📊 No specific data or statistics were mentioned in the video.
🌐 The video did not reference any external sources or references.

📣 Concluding Remarks:

The video provides a step-by-step guide on using an iPhone and the GPT 4 Vision API as a security camera. By leveraging the Google Drive API, GPT 4 Vision API, and Mailgun API, the project successfully captures and analyzes images, detects changes, and generates email reports. The Python code provided in the video allows for easy implementation of this security camera setup.

Made with Talkbud

abdelhaibouaicha
Автор

Nice. Love the way a combo of natural language and Python + APIs works so well. Would be horrendously expensive though to send one photo a minute which is what you really need to make it work as a CCTV

bitcoinisfreedommoney.fckt