Initial Designs
Table of Contents | ||||||
---|---|---|---|---|---|---|
|
Point of View:
Arjun is a 40 years old father living in Mumbai, India whose son Raj is a working professional recently moved to San Francisco.
...
When Raj is busy and unavailable to talk, Arjun wishes to recreate the stories that Raj has told him. He wants to learn more about San Francisco.
Goals
Overall goal:
Raj's parents and friends back home want to create the narratives of his stories.When communicating
Subgoals
- When Arjun and Raj communicate in real time, Arjun would like more contextual information about the locations and events that Raj mentions.
- Arjun wants to catch up on the content that Raj has posted about his life in San Francisco when Raj is not around and "replay" stories that Raj has told to him in the past.
- Arjun wants to be able to
...
- explore the places that Raj has visited in an intuitive way.
Design Sketches
Set 1 (Anant)
...
...
...
...
...
...
...
This is a different kind if interface that gives users more control. It is a direct manipulation interface that les user explore any location they want. It also allows simulate walking or driving. It allows changing speed, pause, and resume. It stretches to ultra-safe as user can asynchronously explore a location.
Set 2 (Alex)
|
|
|
---|---|---|
|
|
|
Text | Text | Text |
Set 3 (Katya)
|
|
|
---|---|---|
|
|
|
Direct manipulation keyboard computer interface. | Desktop interface based on entering information verbally. | iPad/iPhone speech interface. |
Final (Anant)
...
...
...
This interface is inspired from Skype Interface Left sidebar (3 main links) Contacts (to see all the contacts)
...
Final Storyboards
...
| A call session
|
| Arjun wants to explore the place after the call. He goes to Teleport and puts MIT Stata Center and clicks the button "Take Me here"
|
Learnability
- The interface (see the home page) uses the user mental model of chatting, calling and video calling
- Most popular tool used by people for international calling is Skype -- this user interface is consistent with Skype interface
- Auto-highlighting of locations is consistent with hovering affordance (people are likely to hover over highlighted text)
Efficiency
- For direct manipulation -- only keyboard inputs (navigation controls, camera controls and altitude) to avoid switching latency (switching time between keyboard/mouse)
- Avoids any effect of mouse sensitivity
- Allows random exploration
Safety
- For screen 3: if location recognizer fails to recognize a location or recognizes wrong location, user can change it.
- The interface is extremely safe because use can change location, camera angle, altitude, direction at any point of time
Speech and Touch (Katya)
|
|
---|---|
| STEP 1: Arjun talks to his son, Raj, on iPad or iPhone. Raj describes how amazing it was to drive to LA from SFO through Highway 1. Teleport automatically stores spatial descriptions mentioned by Raj. |
| STEP 2: After talking to Raj, Arjun scrolls through spatial descriptions stored and picks one. Avatar ask whether he wants to travel by foot or car and then how would he like to describe the route: 1) by start and end 2) by landmarks 3) by a reference object. |
| Step 3: Depending on the option there are three possible moves: 1) say start and end locations; wait while the animation is generated; watch the animation. 2) say landmark locations; wait while the animation is generated; watch the animation. 3) say the reference object; use 'move forwards','move backward', 'turn right' and 'turn left' commands to navigate around the object. |
| STEP 4: Interaction viewing of animation. While watching the animation by tilting the phone to the right or left Arjun can interactively change viewing angle of the animation. |
| STEP 5: By Tapping over the objects Arjun can see additional information, such as name of the object, the weather at the moment of the conversation or even additional images if added by his son. |
Learnability
- Interface extensively uses visual clues that make interaction extremely intuitive for elderly people who are often not accustomed to the use of conventional desktops. Visual clues take advantage of conventional metaphors and colour.
- The use of metaphors and colour scheme are both internally and externally consistent.
- Furthermore, speech based interaction reduces cognitive load by sequentially offering relevant options.
- The interface implements animated icons that symbolise receiving speech or avatar's speech, giving a clear visual representation of system's state.
- The problem with learnability might occur if the system fails to recognise the speech command because it can be extremely difficult to explain to the user what is wrong in the way they say things: users might be pronouncing words not sufficiently clear, there may be background noise, users can use out of vocabulary words.
- In order to convey what verbal input is expected, questions are designed as either 1) restatement of the phrase used by the avatar (e.g. "Choose one of the following options…" expects a direct restatement of one of the options), or 2) when possible questions prompt a one word response (e.g. "Name the first landmark" or "Name the start of the route").
Efficiency
- Speech based interaction alone may be inefficient for users who use the system very often. The fact that the system asks the same questions can become irritating after some point.
- Nevertheless, the targeted user group is elderly people for whom physical manipulation often may pose additional challenges. From this point of view the interface is considered efficient.
- The interface combines speech and touch which allows to delegate problems that would be difficult to solve with speech only to a touch based interaction. For instance, In order to get additional information about an object the user needs to tap over the object. Another example is that the system records a history of spatial descriptions that the user can scroll through because memorising all the location is not feasible.
- Difficult to convey which statements will work. To help the situation the system asks questions that prompt one word response, such as "Name the first landmark" or "Name the start of the route". The system also provides and animated graphic representation of the expected input.
- Generating an animated video of a route can be very time consuming.
Safety
- The weakness of the system is that there is infinite number of speech commands that users can try.
- The interface improves safety of the system by making it to ask questions sequentially and in case of noisy input ask additional questions that help to disambiguate input.
- Generating animated videos of the route can be time consuming, therefore, there is a risk that users can get impatient and start tapping the screen or giving additional commands.