Vision App - a sample iOS app for image tagging and face detection built with IBM Bluemix OpenWhisk

Vision App is a sample iOS application to automatically tag images and detect faces by using IBM visual recognition technologies.

Take a photo or select an existing picture, let the application generate a list of tags and detect people, buildings, objects in the picture. Share the results with your network.

Overview

Built using IBM Bluemix, the application uses:

Watson Visual Recognition
OpenWhisk
Cloudant

vision_analysis digraph G { node [fontname = "helvetica"] /* stores image */ app -> cloudant /* analyzes the image */ app -> openwhisk {rank=same; app -> openwhisk -> watson [style=invis] } /* openwhisk reads from cloudant */ cloudant -> openwhisk /* whisk passes image to visual recognition */ openwhisk -> watson /* whisk provides result */ openwhisk -> app /* services on top */ {rank=source; cloudant } /* styling ****/ cloudant [shape=circle style=filled color="%234E96DB" fontcolor=white label="Cloudant"] watson [shape=circle style=filled color="%234E96DB" fontcolor=white label="Watson\nVisual\nRecognition"] openwhisk [shape=circle style=filled color="%2324B643" fontcolor=white label="OpenWhisk"] } vision_analysis

The application sends the picture to a Cloudant database. Then it calls an OpenWhisk action that will analyze the picture and send back the results of the analysis.

This application is one example use case. Equipped with the OpenWhisk action implemented in this example, an other use case could be to automatically classify images in a library to improve search capabilities: the same OpenWhisk action but used in a different context. Indeed with this action, we created a microservice for image analysis in the cloud, without deploying or managing a single server.

Application Requirements

IBM Bluemix account. Sign up for Bluemix, or use an existing account.
IBM Bluemix OpenWhisk early access. Sign up for Bluemix OpenWhisk.
XCode 8.1, iOS 10, Swift 3.0

Preparing the environment

Get the code

Clone the app to your local environment from your terminal using the following command:
```
git clone https://github.com/IBM-Bluemix/openwhisk-visionapp.git
```
or Download and extract the source code from this archive

Create the Bluemix Services

Open the IBM Bluemix console
Create a Cloudant NoSQL DB service instance named cloudant-for-vision
Open the Cloudant service dashboard and create a new database named openwhisk-vision
Create a Watson Visual Recognition service instance named visualrecognition-for-vision

Note: if you have existing instances of these services, you don't need to create new instances. You can simply reuse the existing ones.

Deploy OpenWhisk Actions

Ensure your OpenWhisk command line interface is property configured with:

wsk list

Create the action using the following command line replacing the placeholders with the credentials obtained from the respective service dashboards in Bluemix:

wsk action create -p cloudantUrl [URL] -p cloudantDbName openwhisk-vision -p watsonApiKey [123] vision-analysis analysis.js

Configure XCode

To configure the iOS application, you need the credentials of the Cloudant service created above, your OpenWhisk authorization key.

Open vision.xcworkspace with XCode
Open the file vision/vision/model/ServerlessAPI.swift
Set the value of the constant CloudantUrl to the Cloudant service credentials url.
Set the value of the constants WhiskAppKey and WhiskAppSecret to your OpenWhisk credentials. You can retrieve them from the iOS SDK configuration page or you can retrieve the key and secret with the following CLI command:

wsk property get --auth

whisk auth kkkkkkkk-kkkk-kkkk-kkkk-kkkkkkkkkkkk:tttttttttttttttttttttttttttttttttttttttttttttttttttttttttttttttt

The strings before and after the colon are your key and secret, respectively.

Save the file

Running the application

With the iOS simulator

Start the application from XCode with iPhone 6s as the target

Select an existing picture

Note: To add pictures to the simulator, go to the home screen (Cmd+Shift+H). Drag and drop images from the Finder to the simular window. This will open the Photos app and you should see your images.

The picture is sent for analysis and results are returned:

Results are made of the faces detected in the picture and of tags returned by Watson. The tags with the highest confidence score are pre-highlighted. The highlighted tags will be used when sharing the picture. You can tap tags to toggle their state.

Press the Share button. This opens the standard iOS sharing screen.

Note: to configure a Twitter account, go to the Settings app on the simulator. Under Twitter, add your account (no need for the Twitter app to be installed). You can go back to the home screen with Cmd+Shift+H

Pick Twitter as example.

The picture and the highlighted tags are included in the message. The message can be edited before posting.

Code Structure

OpenWhisk

analysis.js holds the JavaScript code to perform the image analysis:

It retrieves the image data from the Cloudant document. The data has been attached by the iOS app as an attachment named "image.jpg".
It saves the image file locally.
If needed, it resizes the image so that it matches the requirements of the Watson service
It calls Watson
It returns the results of the analysis

The action runs asynchronously.

iOS

File	Description
ServerlessAPI.swift	Stores the image in Cloudant and executes the analysis OpenWhisk action, waiting for the result.
Result.swift	Encapsulates the JSON result
HomeController.swift	Manages the selection of an existing picture and taking a picture from the camera
ResultController.swift	Uses ServerlessAPI to send the image for processing and then display the results of the analysis
FacesController.swift	Embedded in ResultController if handles the face collection view
FaceCellRenderer.swift	Renders a face in the FacesController

Contribute

Please create a pull request with your desired changes.

Troubleshooting

OpenWhisk

Polling activations is good start to debug the OpenWhisk action execution. Run

wsk activation poll

and submit a picture for analysis.

A typical activation log when everything goes fine will look like:

Activation: vision-analysis (123fb4230902822202029fff436a94be745)
2016-02-23T16:17:53.955350233Z stdout: [ 49382920fdb022039403934b3bd33d00 ] Processing image.jpg from document
2016-02-23T16:17:59.847872226Z stdout: [ 49382920fdb022039403934b3bd33d00 ] OK

iOS

The application prints several statements to the console as it uploads, analyzes and updates the user interface. Make sure you correctly updated the constants in ServerlessAPI.swift.

Credits

The application uses:

License

See License.txt for license information.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Pods		Pods
img		img
vision.xcworkspace		vision.xcworkspace
vision		vision
xdocs		xdocs
.DS_Store		.DS_Store
.gitignore		.gitignore
License.txt		License.txt
Notice.txt		Notice.txt
Podfile		Podfile
Podfile.lock		Podfile.lock
README-ja.md		README-ja.md
README.md		README.md
analysis.js		analysis.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision App - a sample iOS app for image tagging and face detection built with IBM Bluemix OpenWhisk

Overview

Application Requirements

Preparing the environment

Get the code

Create the Bluemix Services

Deploy OpenWhisk Actions

Configure XCode

Running the application

With the iOS simulator

Code Structure

OpenWhisk

iOS

Contribute

Troubleshooting

OpenWhisk

iOS

Credits

License

About

Releases

Packages

Languages

License

taijihagino/openwhisk-visionapp

Folders and files

Latest commit

History

Repository files navigation

Vision App - a sample iOS app for image tagging and face detection built with IBM Bluemix OpenWhisk

Overview

Application Requirements

Preparing the environment

Get the code

Create the Bluemix Services

Deploy OpenWhisk Actions

Configure XCode

Running the application

With the iOS simulator

Code Structure

OpenWhisk

iOS

Contribute

Troubleshooting

OpenWhisk

iOS

Credits

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages