DotNetByExample - The Next Generation

Making an MRTK3 based HoloLens 2 app run on Magic Leap 2

2024-06-29T00:00:00+02:00

This is a bit of an unusual post, as it does not have a project attached to it. Instead, it’s more like a recipe that I followed to get my app “Walk the World”, originally developed for HoloLens 2, to run on Magic Leap 2. Or actually, how I did it after retracing all my steps, cutting out all the useless detours, pitfalls, and dead ends I encountered trying it the first time.

Upgrade MRTK3

First things first: upgrade to the latest version of MRTK3. There is no two ways about it; otherwise, you will run into issues with the Magic Leap MRTK3 package. Check if your app still works on HoloLens 2 after that.

Switch build target to Android

This is a simple step: switch the build target to Android and close Unity.

Identify and add necessary packages

I found the MRTK3 setup instructions for Magic Leap 2 not very clear, and after a failed upgrade, I devised an alternative strategy.

You first need to pull the template Magic Leap MRTK3 project, which is actually a fork of MRTK3 made by Magic Leap, from GitHub.

Then, compare the manifest file of the template project with the manifest file of your project using the little tool I made for my previous blog post. You will find the dev template manifest in MixedRealityToolkit-Unity\UnityProjects\MRTKDevTemplate\Packages. Use the Magic Leap MRTK3 manifest file as the first argument, and your manifest file as the second.

Finally, you will need to add the Magic Leap scope and the missing packages as described and explained in my previous blog post.

Set Magic Leap audio settings

Open Unity again, and let it import all the packages we just added in the manifest file.

When it’s done, you will see these two message boxes pass by:

Click OK in both cases. In the end, the package manager will pop up, informing you a scope has been added.

To prevent you from getting those two messages about audio every time you open the project, select “MSA Spatializer” and “MSA Ambisonic Plugin” in the audio tab of the project settings.

Install Magic Leap Project Setup Tool

Now it’s time to install the Magic Leap Project Setup Tool from the Unity Asset Store. At the time of this writing, you will get version 2.0.10.

After it’s done importing, this box will pop up:

The choice is yours, but I prefer to go with OpenXR, and in any case, the Magic Leap SDK is deprecated, so that’s a dead end anyway. This tutorial uses OpenXR, so I’d suggest you click “OpenXR.”

The Project Setup Tool will pop up, and all kinds of stuff is being imported into Unity. Wait till it’s done. Then it asks again the same question: s

Click OpenXR again. Then nothing seems to happen for a while (be patient), but finally, the Project Setup Tool shows a bunch of green and yellow buttons.

Configure settings using the Project Setup Tool

It’s very tempting to click “Apply All”, but my experience is that things usually work out better if you press the buttons one by one and wait for the result.

“Enable plugin” will pop up the XR Plugin management; put that aside and continue.
“Use DXT Texture Compression” will open a progress box that will close again.
“Apply XR validation” will do something, whatever it is, but the button turns green.
“Change to X86_X64 architecture” changes the architecture; you will see nothing happening but the button turn green.
“Set Minimum Android API level”: Set API level to 29 (currently), no visual clue but green button.
“Update the Manifest file” will undoubtedly add something to the manifest, although I don’t know what - button turns green.
“Use Vulkan Graphics API” - set the graphic API, button turns green.

At this point, all the buttons should be green.

Setting necessary permissions.

Now it’s time to review the permissions. We know these in HoloLens as capabilities. Some capabilities are just granted when they are selected - like internet access - but some require explicit user consent - like microphone access. But all the capabilities for HoloLens 2 are in one list; you can’t really see which require consent or not - but in any case, HoloLens 2 will automatically ask for consent when necessary.

Magic Leap 2 knows a similar concept called permissions. The difference is: they are explicitly divided into two groups: normal and ‘dangerous’ permissions. Dangerous permissions need explicit user consent. It’s neat you can easily see what permissions are considered dangerous - although it’s also rather odd spatial maaping is considered a dangerous permission. Magic Leap is a Mixed Reality device, and the very point and USP of such a device is the ability to interact with reality. It feels a bit like buiding a boat but not putting it into water without special permission. I can’t really see how it possibly would open doors to surreptitious activities like, for instance, getting microphone access would - but hey, whoever makes the devices, writes the rules. Anyway, the Magic Leap 2 can also ask for permissions automatically, provided you actually set that option.

Anyway, we need (or want) to use the app with hand tracking, and we also require spatial mapping as we want to be able to put the map on the floor:

Also different is that permissions need to be set twice, in two different places - on the Magic Leap level, and the MRTK3 level. These are not always nicely in sync. Make sure they are in sync before proceeding.

Notice the “Auto Request Dangerous Permissions”. Do not forget to set that.

Setting the MRTK 3 profile

The MRTK3 cannot work without a profile. So we need to select one.

If you click the tempting button “Assign MRTK default,” it looks like you are done with it - but alas, you will set the MRTK default profile, which is wrong. You see, Magic Leap 2 uses different subsystems. So to be sure you don’t get any nasty surprises, you need to select the little dot on the right (marked 1) and then the eye icon on the right of the popup window - and you will see no less than 3 MRTK profiles. As you might have guessed, we need the MRTKProfile-MagicLeap-OpenXR.

Configuring Magic Leap OpenXR settings

I added the two interaction profiles in the red square by clicking the little plus icon indicated by the arrow top right, and turned on the Hand Tracking System. I turned off the Magic Leap Spatial Anchor Storage because I noticed a lot of errors in the Unity Application Log when I checked that using my ServiceFramework-based FileLoggerService. The Pixel sensor thing was already off by default, so I left it off. I am not sure whether you can turn off other things and still have it work, nor what kind of advantages that might have. I haven’t gone deep into this.

Turning off the old Unity Input System

If you build and get this error:

Change this in Other settings (in Player settings comment_issue_id: 472)

and restart the editor.

You will then have to check if you have a nifty help function like this one, that shows the hand menu in the editor by simply pressing the “H” key. Because this particular one is between #if UNITY_EDITOR preprocessor tags, it will not hurt the app, but by golly, does it keep the Unity console busy with errors. So best remove functions using stuff like this:

#if UNITY_EDITOR       
private void Update()
{
    if (Input.GetKeyDown(KeyCode.H))
    {
        var menu = visuals.transform.GetChild(0);
        menu.gameObject.SetActive(!menu.gameObject.activeSelf);
        visuals.transform.position = 
           LookingDirectionHelpers.CalculatePositionDeadAhead(0.3f);
        visuals.transform.LookAt(Camera.main.transform);
        visuals.transform.Rotate(Vector3.up, 180);
    }
}
#endif

Change some input settings

These are not mandatory but highly recommended. Go to Player Settings/InputSettings/Other settings and change these settings:

You never saw this before if you only did HoloLens development, but in Unity for Android, you need to set a package name. Don’t use the default, but enter a real package name.

Then we have this thing. In HoloLens 2 apps (or rather, UWP apps), the name of the product is not necessarily the text that appears on the label behind the icon. In Magic Leap 2 apps (or rather, Android apps) it is, so if you have some random project name, please change it into something that looks nice, like I did.

Upgrade packages

Also not mandatory but recommended: after you have confirmed the app runs, you might also consider looking at the package manager to see if any packages can or need to be upgraded. My package manager says some packages have slightly new versions indeed. I upgraded most of them, and it still works. Not sure what I gained.

Concluding words

After you have completed the mandatory steps, your app should simply be able to compile, and you should be able to deploy and run it on Magic Leap 2. Of course, your mileage may vary if you use very specific HoloLens 2 functionality, like iris login or tracking QR codes.

A word of warning: when I got the Magic Leap 2 some time ago, it was running OS 1.4.0. In the mean time, we had at least 4 upgrades and are now at 1.8.0. When I initially developed this app (on 1.7.0), I could not get the automatic “ask for permission” to work, the wrong controller model showed up, I did not get hand visualization, and I could not operate canvas buttons with the controller. I tested it with 1.7.0 and 1.8.0. After a lengthy discussion with the folks on the ML2 forums, who could not repro my problems even if sent them an APK, It was concluded there might be something with the device. That was the right call - a factory reset made all my issues disappear without having to change any code. Maybe I did something wrong during update; I also remember at one point having a preview installed. Long story short: if you see weird stuff like this, and someone else does not: factory reset!

Enjoy your cross-platform XR development. If you use this ‘recipe’, let me know how it worked out for you.

A little tool to compare two Unity Manifest files

2024-06-18T00:00:00+02:00

I recently published a Walk the World version for Magic Leap 2, but I would be lying if I said it was as easy this time as when I ported HoloATC. Things have changed massively since Magic Leap fully embraced the OpenXR standard, and the process of re-configuring an existing MRKT3 HoloLens 2 app to run on Magic Leap 2 has become a bit more complex. One of the things I could not really figure out definitively from the documentation was: what extra packages I needed to install, and what versions. At the first try, my app did not show anything; at the second, it crashed at startup.

However, as they state here, there is what they call a ‘template project’ that actually runs, but unfortunately, that is a fork of the MRTK3 itself. So it’s built upon the sources, not the packages, and therefore not really usable to build your own project on.

Talk is cheap, show me the manifest

But it can be used as a sample, and more importantly, it has a manifest file and that very definitively showed what I need. Normally this is maintained using the package manager, but in the end, it’s only a JSON file that can be edited using any old text editor. Unfortunately, it also tends to be pretty long, and comparing two of those files can be a time-consuming and error-prone process. To make sure I didn’t make any (more) mistakes, I wrote a small .NET 8 tool called UnityManifestCompare to do that for me. It basically shows:

What extra scoped registries the first manifest had over the second one
What extra packages the first manifest had over the second one
What packages appear in both but have different versions.

JSON data structure

As I said, the manifest file is simply a JSON structure, so first, I defined two simple data classes to hold that JSON. We start with the manifest structure itself

internal class Manifest
{
    [JsonProperty("scopedRegistries")]
    public List<ScopedRegistry> ScopedRegistries { get; set; }

    [JsonProperty("dependencies")]
    public Dictionary<string,string> Dependencies { get; set; }
}

And then the scoped registry:

internal class ScopedRegistry
{
    [JsonProperty("name")]
    public string Name { get; set; }

    [JsonProperty("url")]
    public string Url { get; set; }

    [JsonProperty("scopes")]
    public List<string> Scopes { get; set; }
}

Main program

The main program itself accepts two arguments. The first one is the source (in my case the Magic Leap project manifest file) and the second one the ‘target’, the one that should be adapted. It does the three comparison tasks and prints some layout stuff in between:

internal class Program
{
    static void Main(string[] args)
    {
        var firstManifest = JsonConvert.DeserializeObject<Manifest>(
           File.ReadAllText(args[0]));
        var secondManifest = JsonConvert.DeserializeObject<Manifest>(
           File.ReadAllText(args[1]));

        ShowExtraScopedRegistries(firstManifest, secondManifest);
        Console.WriteLine();
        Console.WriteLine(
           "===============================================================");
        Console.WriteLine();
        ShowExtraDependencies(firstManifest, secondManifest);
        Console.WriteLine();
        Console.WriteLine(
           "===============================================================");
        Console.WriteLine();
        ShowVersionDifferences(firstManifest, secondManifest);
    }
}

Added scoped registries

This is simply getting all names from the first registry list that are not in the second list, conveniently printing them out in a format that can be added to the manifest file directly:

private static void ShowExtraScopedRegistries(Manifest firstManifest, 
                                              Manifest secondManifest)
{
    var uniqueToFirstManifest = firstManifest.ScopedRegistries
        .Where(entry => 
           secondManifest.ScopedRegistries.All(sr => sr.Name != entry.Name))
        .ToList();

    Console.WriteLine("Scoped registries missing from the second manifest:");
    Console.WriteLine("---------------------------------------------------");
    foreach (var scopedRegistry in uniqueToFirstManifest)
    {
        Console.WriteLine($"{JsonConvert.SerializeObject(scopedRegistry, 
          Formatting.Indented)},");
    }
}

Result, in my case:

Scoped registries missing from the second manifest:
---------------------------------------------------
{
  "name": "Magic Leap",
  "url": "http://registry.npmjs.org",
  "scopes": [
    "com.magicleap"
  ]
},

I added this to my manifest file, which now looks like this:

{
  "scopedRegistries": [
    {
      "name": "OpenUPM",
      "url": "https://package.openupm.com",
      "scopes": [
        "com.atteneder",
        "com.github-glitchenzo.nugetforunity",
        "com.openupm",
        "com.realitycollective"
      ]
    },
    {
      "name": "Magic Leap",
      "url": "http://registry.npmjs.org",
      "scopes": [
        "com.magicleap"
      ]
    }
  ],

Do not forget to remove the trailing comma if you add it to the bottom!

Added packages

Also not rocket science: simply list all the key-value combinations where the key does not appear in the second list.

private static void ShowExtraDependencies(Manifest firstManifest, 
                                          Manifest secondManifest)
{
    var uniqueToFirstManifest = firstManifest.Dependencies
        .Where(entry => !secondManifest.Dependencies.ContainsKey(entry.Key))
        .ToDictionary(entry => entry.Key, entry => entry.Value);

    Console.WriteLine("Dependencies missing from the second manifest:");
    Console.WriteLine("----------------------------------------------");

    foreach (var dependency in uniqueToFirstManifest)
    {
        Console.WriteLine($"\"{dependency.Key}\": \"{dependency.Value}\",");
    }
}

Result:

Dependencies missing from the second manifest:
----------------------------------------------
"com.atteneder.ktx": "https://github.com/atteneder/KtxUnity.git#v2.1.2",
"com.magicleap.mrtk3": "1.0.0",
"com.magicleap.soundfield": "3.4.4-231122.68.6849fab",
"com.magicleap.unitysdk": "2.2.0",
"com.microsoft.mixedreality.visualprofiler": "https://github.com/microsoft/VisualProfiler-Unity.git#v2.2.0",
"com.unity.asset-store-validation": "0.5.1",
"com.unity.inputsystem": "1.7.0",
"com.unity.mobile.android-logcat": "1.4.2",
"com.unity.performance.profile-analyzer": "1.2.2",
"com.unity.xr.arcore": "5.1.4",
"com.unity.xr.arfoundation": "5.1.4",
"com.unity.xr.hands": "1.4.1",
"com.unity.xr.interaction.toolkit": "2.5.4",
"com.unity.xr.magicleap": "7.0.0",
"com.unity.xr.management": "4.4.0",
"com.unity.xr.openxr": "1.10.0",
"org.mixedrealitytoolkit.accessibility": "file:../../../org.mixedrealitytoolkit.accessibility",
"org.mixedrealitytoolkit.data": "file:../../../org.mixedrealitytoolkit.data",

Now you have to be careful with everything that has “file:” in it as this refers to local files. So if you need those, you should install them from the MRTK Feature Tool. I didn’t need them, so I skipped them. The important thing is: now you see all the extra stuff Magic Leap needs for an MRTK3 project, and you can copy it simply into the dependencies section of your manifest. I even added the commas at the end for convenience.

Different versions:

Basically the same idea: show everything that has the same key but a different value. This is the least interesting part, as this can be fixed mostly with the package manager, and also usually has less impact than missing packages:

private static void ShowVersionDifferences(Manifest firstManifest, 
                                           Manifest secondManifest)
{
    var differentVersions = firstManifest.Dependencies
        .Where(entry => secondManifest.Dependencies.ContainsKey(entry.Key) 
          && secondManifest.Dependencies[entry.Key] != entry.Value)
        .ToDictionary(entry => entry.Key, entry => (entry.Value, 
           secondManifest.Dependencies[entry.Key]));

    Console.WriteLine("Dependencies with different versions:");
    Console.WriteLine("-------------------------------------");

    foreach (var dependency in differentVersions)
    {
        Console.WriteLine(
        $"{dependency.Key}: {dependency.Value.Item1} vs {dependency.Value.Item2}");
    }
}

Since I installed the MRTK from the MRTK Feature Tool, every package was listed, so I am not going to list the whole output. I just show a few samples that showed some inconsequential version changes:

Dependencies with different versions:
-------------------------------------
com.atteneder.gltfast: https://github.com/atteneder/glTFast.git#v4.8.3 vs 5.0.4
com.microsoft.mixedreality.openxr: file:../../../ExternalDependencies/com.microsoft.mixedreality.openxr-1.10.0.tgz vs file:MixedReality/com.microsoft.mixedreality.openxr-1.10.1.tgz
com.microsoft.mrtk.graphicstools.unity: https://github.com/microsoft/MixedReality-GraphicsTools-Unity.git?path=/com.microsoft.mrtk.graphicstools.unity#v0.6.6 vs file:MixedReality/com.microsoft.mrtk.graphicstools.unity-0.7.0.tgz
com.microsoft.mrtk.tts.windows: file:../../../ExternalDependencies/com.microsoft.mrtk.tts.windows-1.0.4.tgz vs file:MixedReality/com.microsoft.mrtk.tts.windows-1.0.4.tgz
com.microsoft.spatialaudio.spatializer.unity: file:../../../ExternalDependencies/com.microsoft.spatialaudio.spatializer.unity-2.0.55.tgz vs file:MixedReality/com.microsoft.spatialaudio.spatializer.unity-2.0.55.tgz
com.unity.textmeshpro: 3.0.6 vs 3.0.7

Concluding words

Using this tool, and courtesy of Magic Leap now hosting everything - including their MRTK3 package - on their own registry, there’s no need to mess around with the Magic Leap Setup Tool to connect to local copies of the SDK and the Unity Package manager to get everything you need. In my case, Unity even hung twice after importing the SDK - this does not happen now. Please note: you still do need the Setup Tool - but for the other settings. And of course, UnityManifestCompare can be used to compare the setup for other purposes than configuring for Magic Leap as well - you can just as easily use it to port an app from Magic Leap 2 to HoloLens 2 - but it was a vital step in my porting process.

By the way: most actual code written by GitHub CoPilot - with some minor additions by me :).

Project code, as usual, on GitHub.. You can also directly download the built project if you don’t want to build it yourself. You simply run it from the command line:

UnityManifestCompare manifest1.json manifest2.json

Note:

Please make a backup of your manifest.json before you start messing with it
To run use the latest MRTK3 stuff on Magic Leap, you need the latest MRTK3. More details later.

Logging Mixed Reality app data in Azure Application Insights using the official Microsoft SDK

2024-06-03T00:00:00+02:00

If you are supporting any kind of professional business app, you don’t want to rely on users reporting issues. They typically concentrate on doing their job, which in their opinion usually does not include detailed descriptions, reproduction paths, or even just the error message itself (especially if it’s longer than five words). So you want the app to ‘phone home’ itself. Microsoft have been doing that for years, if not decades, and so have I since I started putting Mixed Reality apps out into the wild. To this end, I have been successfully using the UnityApplicationInsights project from GitHub to log telemetry in Azure Application Insights. It simply uses the instrumentation key, you plonk a behaviour in your scene, and you are good to go. However, this code has not seen an update in 6 years, and recently when I looked at some Application Insights documentation I noticed this banner:

So I tried to get the official SDK to work with Unity on a HoloLens 2, and it works. There is some setup required though, and I have made it into a simple reusable RealityCollective Service

TelemetryService functions

The main functions of the service are:

Log an event
Log an exception
Automatically log an event when a keyword appears in the Unity log. How this works will be explained later

Using the TelemetryService

In the demo project, there is a simple menu with four buttons. It’s driven by the class DemoMenuHandler that initializes the service, and various other buttons that start code that almost physically hurt to write, so bad is it, but it is intended to actually generate errors.

Initializing the service

On top of the DemoMenuHandler, you see what begins like a pretty standard Service Framework initialization:

private async void Start()
{
    await ServiceManager.WaitUntilInitializedAsync();
    telemetryService = ServiceManager.Instance.GetService<ITelemetryService>();
    telemetryService.Initialize("your_connection_string_here");
}

Assuming you have already set up an Application Insights resource in Azure, you will need to change “your_connection_string_here” to the thing that appears in “CONNECTION STRING.” Typically it looks like this: “InstrumentationKey=_some_guid_here;IngestionEndpoint=https://something.applicationinsights.azure.com/; LiveEndpoint=https://something.livediagnostics.monitor.azure.com/;ApplicationId=some_other_guid”

So why didn’t I put this into the service profile? Well, because you don’t bake secrets into an app. Make sure you download this from a secure location where you can change it easily, preferably after the user has logged in, from an authenticated API. Although having those keys doesn’t give bad actors access to your app, they might use it just to spam your application insights with nonsense just to bug you. It pays to be paranoid these days.

Using the service from code

The TelemetryService has a surprisingly simple public API, as defined by ITelemetryService:

public interface ITelemetryService : IService
{
    void Initialize(string connectionString, string userId = null);
    public void TrackEvent(string eventName, Dictionary<string, string> properties = null);
    public void TrackEvent(string eventName, 
       params string[] properties);
    public void TrackException(Exception exception);
}

We already have seen Initialize in action: this sets the connection string, and an optional user id to appear in every call.
The first TrackEvent method allows you to send a simple custom event to the ApplicationInsights customEvents collection. Anything you put into the optional Dictionary will be added as custom properties (I will show that later).
The second TrackEvent does the same, but allows the developer who is too lazy to define a Dictionary to supply the properties and values as string parameters (see sample code).
TrackException sends a detailed exception log to the ApplicationInsights exceptions collection

Initialization sample:

await ServiceManager.WaitUntilInitializedAsync();
telemetryService = ServiceManager.Instance.GetService<ITelemetryService>();
telemetryService.Initialize("your_connection_string_here");

Tracking an event with some properties:

telemetryService.TrackEvent("Hello world button clicked", 
    "prop1", "value1", "prop2", "value2");

Doing the same, but using the Dictionary method:

telemetryService.TrackEvent("Hello world button clicked", 
    new Dictionary<string, string>
    {
        {"prop1", "value1"},
        {"prop2", "value2"}
    });

Tracking an exception:

try
{
   // Do someting
}
catch (Exception e)
{
    telemetryService.TrackException(e);
    throw;
}

Automatically trapping application events

Unity is a pretty chatty platform that dumps all kinds of information in its application log. Those who have read my blog about the FileLoggerService have seen this before. You can tap into this log by subscribing to the Application.logMessageReceivedThreaded event. The TelemetryService can do just that for you, check for whatever keyword or phrase you want, and log this to Application Insights as a custom event. The default DefaultTelemetryServiceProfile looks like this:

So whenever, wherever in the app something like this happens - you will know. This is a very good way to surface hidden errors that may not have immediate results like the app misbehaving or even crashing, but are a good indication of where some of the ‘Heisenbugs’ originate that typically plague any piece of software - yes, yours too. Admit it.

As you might have noticed, the configuration also offers a checkbox that shows the events and exceptions it sends in the debug log.

Setting up the project from scratch

Assuming you have set up a proper MRTK 3.0 project, the way to go is this:

Install the Reality Collective Service Framework from OpenUPM. For the moment, I would suggest version 1.0.7, the latest build that is not in preview
Install NuGet for Unity
Using NuGet for Unity, install package Microsoft.ApplicationInsights

As of today, this will create three folders in your Assets/Packages folder
- Microsoft.ApplicationInsights.2.22.0
- System.Diagnostics.DiagnosticSource.5.0.0
- System.Runtime.CompilerServices.Unsafe.5.0.0
As ApplicationInsights apparently has some fancypants reflection-based initialization code, you will need to place a link.xml in the Packages folder to prevent the compiler stripping away an ‘unused’ constructor that is used anyway. The link.xml will need to contain this code:

   fullname="Microsoft.ApplicationInsights" preserve="all"/>

Copy the four files in Assets/LocalJoost/Services/Telemetry into your project
Add a Service Manager Instance to your scene (Tools/Service Framework/Add to Scene)
Configure the TelemetryService to run on the platform(s) you want, you can use the DefaultTelemetryServiceProfile as configuration if you want - or create your own, or use none at all
On a HoloLens project, don’t forget to assign the “Internetclient” capability

And you are good to go.

How it looks like in Azure

If you go to your Azure portal, select the Application Insights resource, then the “Logs” option, you can write queries on the Application Insights customEvents collection. It looks like this:

You can also see where the app has been used.

If you expand the “customEvent” property, you can, for instance, see the detailed issue of a tracked NullReferenceException text appearing in the log

In fact, you can see quite some detailed info about the device and the location that Application Insights apparently collects automatically.

You can also, as I said before, query the “exceptions” collection that is actually more suited for displaying exception information

It shows very detailed information about the exception:

So although the keyword tracking option is a nice fallback, actually trapping the exception yourself and sending info is much more helpful. But then again, you first need to be aware things go wrong.

So how does this all work?

Initialization

The service creation is pretty standard, we stick the profile in a private field

public TelemetryService(string name, uint priority, TelemetryServiceProfile profile)
    : base(name, priority)
{
    serviceProfile = profile;
}

The Initialize method can only be called once. It configures the TelemetryClient from the Microsoft SDK, adds data to the telemetry context (see below), then adds the keywords it should track from the log (even further below)

public void Initialize(string connectionString, string userId = null)
{
    if(telemetryClient != null)
    {
        return;
    }
    user = userId;
    var config = new TelemetryConfiguration();
    config.ConnectionString = connectionString;
    telemetryClient = new TelemetryClient(config);
    AddTelemetryContext(telemetryClient.Context);
    TrackUnityLogKeywords();
}

Adding context

Context are the standard (that is, not custom) properties of telemetry. You can either add those to an event or to the client itself. If you use the latter (like me) the client will automatically add those property values to every event. This is what happens at the bottom of the service:

private void AddTelemetryContext(TelemetryContext context)
{
    context.Component.Version = GetAppVersion();
    context.Device.Id = SystemInfo.deviceUniqueIdentifier;
    context.Device.OperatingSystem = SystemInfo.operatingSystem;
    context.Device.Type = SystemInfo.deviceModel;
    if (user != null)
    {
        context.User.Id = user;
    }
}

private string GetAppVersion()
{
#if WINDOWS_UWP
    var version = Windows.ApplicationModel.Package.Current.Id.Version;
    return $"{version.Major}.{version.Minor}.{version.Build}.{version.Revision}";
#else
    return Application.version;
#endif        
}

Note that the application version for HoloLens apps is retrieved differently than when using other platforms. This is because the ‘other platforms’ are usually Android-based, which Unity generates in one go, while HoloLens uses the two-stage process that first generates a C++ UWP project - and the version is ultimately decided by what’s in that project’s manifest, which may be (and usually is) different from what is defined by Unity’s project settings.

Tracking keywords

The setup is pretty easy: it copies all the keywords to track into a list, removing duplicates. If there are any keywords to track, the service attaches to the Application.logMessageReceivedThreaded event.

private void TrackUnityLogKeywords()
{
    if (serviceProfile == null)
    {
        return;
    }
    if (serviceProfile.LogKeywordsToTrack.Any())
    {
        Application.logMessageReceivedThreaded += OnLogMessageReceived;
    }

    foreach (var keyword in serviceProfile.LogKeywordsToTrack)
    {
        if (!logKeywordsToTrack.Contains(keyword))
        {
            logKeywordsToTrack.Add(keyword);
        }
    }
}

The code that listens to the event is actually pretty simple:

private void OnLogMessageReceived(string condition, string stacktrace, LogType type)
{
    var keywordFound = logKeywordsToTrack.FirstOrDefault(condition.Contains);
    if (keywordFound != null)
    {
        TrackEvent($"Keyword tracked: {keywordFound}", 
            "message", condition, "stacktrace", stacktrace);
    }
}

Tracking an event in code

The TrackEvent simply passes the event and its properties to Application Insights, and optionally logs it to the debug log. Note it refuses to do that for keyword tracked events: otherwise, we would get a kind of endless feedback log loop.

public void TrackEvent(string eventName, 
                       Dictionary<string, string> properties = null)
{
    if( telemetryClient == null )
    {
        return;
    }
    telemetryClient.TrackEvent(eventName, properties);
    telemetryClient.Flush();
    if(serviceProfile != null && serviceProfile.LogToConsole && !eventName.Contains("Keyword tracked"))
    {
        var propertiesString = properties != null
            ? string.Join(", ", 
            properties.Select(p => $"{p.Key}: {p.Value}"))
            : string.Empty;
        Debug.Log($"Telemetry event: {eventName} {propertiesString}");
    }
}

The other TrackEvent method and the TrackException method I leave as an exercise for the reader.

Concluding words

Application Insights is an awesome and simple-to-use telemetry solution. However, as you have seen, it logs quite some detailed information. You might want to think if you actually want and need all that information, considering GDPR and such. You should think about what you need to log and when, consider a retention strategy, decide when you delete stuff and what. Note, however, it does show a bogus IP address, and the location presented by XR devices like HoloLens is where my internet provider hooks up to the internet - so it does show Amsterdam instead of my actual location in Amersfoort.

If your app is authenticated, you can even provide a user id at Initialize which can help you track down issues by specific user, and link entries to user error reports. That’s all fine, but never provide the direct user name or some id that is relatable to an actual person. Instead, use some token or id that can only be related to data inside your backend - like an encrypted Entra id.

A demo project, as always, can be found on GitHub.

Unit testing UI interaction with MRTK3

2024-03-05T00:00:00+01:00

Unit testing is a method software developers use to ensure code reliability. It involves writing extra code to input known data into other code - the business logic - expecting a known output. For example, testing an Add function with “Add(1,2)” and expecting the result to be 3. If the test passes, the function is ‘covered’ by tests. Ideally, testing all components and their assemblies should ensure the whole program works.

However, users interact mostly with the user interface, not the business logic. If the UI is flawed, even with correct business logic, it’s like having a car with all components tested except the connection between the steering wheel and the wheels. At some point, a user is going to messily encounter this oversight while taking an offramp doing 75.

So how do we deal with this?

In most cases, this is solved by a comprehensive manual testing plan. However, the MRTK3 contains a lot of UI interaction tests, and a host of classes that make simulated user input possible. And the greatest thing is: with some futzing around, you can actually use those classes yourself to make your own simulated UI test:

The requirements

For this demo, I have put together a simple set of requirements:

If the user presses a button, it should be toggled. If the same button is pressed again, it should be untoggled.
Only one of the four buttons can be toggled at one time. If a button is pressed when another button is already toggled, that other button should be untoggled.
If the close button is pressed, the menu should be closed, that is, destroyed.

The menu itself you could see (very briefly) in the intro ‘movie’ at the top of this article.

Setting up the project for UI Unit testing

I am assuming a project set up for MRTK3 and having some functionality in it.

First, right-click on “Assets” in your project, then hit Create / Testing / Test Assembly Folder and give it a name. I called mine InteractionTests. Now, as I have written before, if you define assemblies yourself, Unity is not going to do you the pleasure of auto-referencing assemblies anymore, so you all have to do that yourself. Which ones, depends on what you need. I have found out that for my particular tests, we need to add the following assemblies as references:

The first two are added by default. Assemblies MRTK.Input.RuntimeTests and MRTK.Core.TestUtilities contain the actual utility classes we need to write code to simulate input, and the other three contain classes that we need to check results - like if a button is toggled or not.

The next step is very weird. You see, if you now write code in your test class and use the TestHand class from the MixedReality.Toolkit.Input.Tests namespace, Unity cannot find it. While it very clearly is there:

It wasn’t until I stumbled on the 17th comment on this post from 2019 in the Unity forums that I found out what I needed to do: go to the Packages folder, manually edit the manifest.json file, and add the following at the end:

"testables" : 
[   
    "org.mixedrealitytoolkit.input",
    "org.mixedrealitytoolkit.core" 
] 

You can’t make this up.

As always, if you know what you are looking for, the testables entry is actually mentioned in Unity’s infamously confusing documentation, but not with this critical piece of knowledge. Anyway, the result is A) you can now finally use the MixedReality.Toolkit.Input.Tests utility classes, and B) all the unit tests in both assemblies now show up in your Test Runner, next to the ones you are going to add (in this picture, they already are).

The menu looks like this, and we need this information to be able to see how the menu responds to input actions.

Test class setup

For UX tests, we can utilize the BaseRuntimeHandInputTests class. This is a subclass of the MRKT BaseRuntimeInputTests, that takes care of a lot of things, like setting up a test scene with an MRTK XR Rig, and destroying it after the test.

public class ButtonsTests : BaseRuntimeHandInputTests 
{
    private const string MenuGuid = "e9ddf3517c4b9c7488c12bdec6a9917f";
    private GameObject testGameObject;
    private List<pressablebutton> allButtons;

At the top, you see the menu prefab guid, which you can find in the Menu.prefab.meta file:

fileFormatVersion: 2
guid: e9ddf3517c4b9c7488c12bdec6a9917f
PrefabImporter:
  externalObjects: {}
  userData: 
  assetBundleName: 
  assetBundleVariant:

As well as some other things we will need in the tests. Below that, the Init method creates the prefab and gathers some information about the prefab: its initial position and the buttons.

[SetUp]
public void Init()
{
    testGameObject = InstantiatePrefab(MenuGuid);
    allButtons = FindByName(testGameObject, "Buttons-GridLayout").
                    GetComponentsInChildren<pressablebutton>().ToList();
}

The Teardown just destroys the object. Note it does not need a [TearDown] attribute; the base class takes care of that.

public override IEnumerator TearDown()
{
    yield return base.TearDown();
    Object.Destroy(testGameObject);
}

Testing toggle button states

The test that tests requirement 2 - only one button can be toggled - is the most complex. Or actually, it does the most.

Setting up the test data

[UnityTest]
public IEnumerator PressingTwoDifferentButtonsShouldOnlySelectTheLast()
{
    var pressedButtons = new List<pressablebutton>();
    var initialHandPosition = GetInitialHandPosition();
    TestHand hand = null;
    yield return GetHand(initialHandPosition, h => { hand = h; });

We need a list with buttons that are already pressed to make sure we don’t press the same button twice. Then we calculate get a first hand postion, which does not matter really, but we need an initial position. And then we test the initial condition: no buttons pressed.

Testing a press

First, we test if there are no toggled buttons. Then we move from hand to hand, poke the button, and every time there should only be one button toggled at any time.

Assert.AreEqual(0, GetToggledButtonCount());

foreach(var button in allButtons)
{
    var handPosition = 
        GetInitialHandPositionBefore(button.gameObject, HandInFrontOfGameObject);
    yield return MoveHandTo(hand, handPosition);
    yield return PokeHand(hand, HandInFrontOfGameObject);
    Assert.AreEqual(1, GetToggledButtonCount());
    AddButtonToPressedList(pressedButtons);
}

The helper methods that do it all

A lot of the helper methods that I created to make things easier, are defined in base class BaseRuntimeHandInputTests (that extends the MRTK3 class BaseRuntimeInputTests, as I metioned before).

public abstract class BaseRuntimeHandInputTests : BaseRuntimeInputTests
{
    protected const int HandMoveSteps = 1;
    protected const int UpdateFrames = 1;
    protected const float HandInFrontOfGameObject = 0.15f;
    protected const float InitialHandInFrontOfUserDistance = 0.2f;

HandMoveSteps is used in the methods that actually move the hand; the lower the number, the fewer steps are taken in moving the hand - so the hand moves faster. UpdateFrames is the wait time after a hand move or creation. Here also goes: a lower number is a faster unit test. These numbers might be adapted to debug the test visually.

Determining the initial hand position

protected Vector3 GetInitialHandPosition(
    float initialDistance = InitialHandInFrontOfUserDistance)
{
    return InputTestUtilities.InFrontOfUser(Vector3.forward * initialDistance);
}

protected Vector3 GetInitialHandPositionBefore(
    GameObject testGameObject, 
    float initialDistance = HandInFrontOfGameObject)
{
    return testGameObject.transform.position - Vector3.forward * initialDistance;
}

There are basically two methods doing this: GetInitialHandPosition gets a position before the user, GetInitialHandPositionBefore get a position in front of a game object, so you can move the hand simply forward and press a button, for instance.

Initializing the hand

A bit of an oddball method - it creates the hand at the initial position. Since an IEnumerator can’t return a value and also can’t use ref out variables (I tried), I used a lambda to return the actual hand:

protected IEnumerator GetHand(Vector3 initialHandPosition, Action<testhand> action)
{
    var hand = new TestHand(Handedness.Right);
    yield return hand.Show(initialHandPosition);
    yield return RuntimeTestUtilities.WaitForUpdates(UpdateFrames);
    action(hand);
}

Note the RuntimeTestUtilities.WaitForUpdates call. This needs to be done after every hand creation or move; otherwise, the test code will throw a “Cached unprocessed value unexpectedly became outdated for unknown reason, new value ‘0’ old value ‘3’” error.

Moving the hand

With everything in place, now it’s very simple to move the hand.

protected IEnumerator PokeHand(TestHand hand, float distance)
{
    yield return MoveHand(hand, Vector3.forward * distance);
    yield return MoveHand(hand, -Vector3.forward * distance);
}

protected IEnumerator MoveHand(TestHand hand, Vector3 distance)
{
    yield return hand.Move(distance, HandMoveSteps);
    yield return RuntimeTestUtilities.WaitForUpdates(UpdateFrames);
}

protected IEnumerator MoveHandTo(TestHand hand, Vector3 location)
{
    yield return hand.MoveTo(location, HandMoveSteps);
    yield return RuntimeTestUtilities.WaitForUpdates(UpdateFrames);
}

PokeHand moves the hand forward and backward, MoveHand moves the hand forward over a specific vector (so relative from the current position), and MoveHandTo moves the hand to a specific absolute location. These methods only add a RuntimeTestUtilities.WaitForUpdates but it’s a bit annoying to have to add that yourself after every call.

Some bits & pieces

Making sure all buttons are pressed

The test method itself actually checks every time if only one button is toggled, but this would still work for pressing only two buttons (back and forth). To make sure every button press is a different press, I wrote this little routine:

private void AddButtonToPressedList(List<pressablebutton> pressedButtons)
{
    var button = buttons.FirstOrDefault(b => b.IsToggled);
    if (!pressedButtons.Contains(button))
    {
        pressedButtons.Add(button);
    }
    else
    {
        Assert.Fail("Button already pressed");
    }
}

Creating a new prefab from a guid

This I basically stole from existing code in the MRTK3 itself, with a little adaptation. This is how you load a prefab from a guid, then instantiate it.

private GameObject InstantiatePrefab(string guid)
{
    var prefabPath = AssetDatabase.GUIDToAssetPath(guid);
    var prefab = AssetDatabase.LoadAssetAtPath(prefabPath, typeof(Object));
    return Object.Instantiate(prefab) as GameObject;
}

Finding a child object by name

This is a rather standard routine that recursively looks for an a game object by name, below a starting object.

protected GameObject FindByName(GameObject parent, string name)
{
    if (parent.name == name)
    {
        return parent;
    }
    foreach (Transform child in parent.transform)
    {
        var result = FindByName(child.gameObject, name);
        if (result != null)
        {
            return result;
        }
    }
    return null;
}

Concluding words.

Yes, I know this is a trivial case. Yes, I know PressableButton has methods that can simulate clicks, so you don’t need to go this roundabout way. Yes, I know the only-one-button-toggled logic should be driven by business logic that could be checked. Yes, I also know this is technically integration testing, not unit testing. That is not the point of this blog post: the point is to show how to set up and execute these kinds of automated UI tests using stuff that is already in the MRTK3. You can do all kinds of nifty things with hands, and this is very useful for finding events that are wired up in the editor but were broken later. The code in this blog post can be a useful starting point.

Demo project with full code and setup, as usual, on GitHub.

Detecting user presence using MRTK3 gaze tracking state

2024-02-17T00:00:00+01:00

Sometimes it’s necessary to know whether or not the user is actually wearing the headset while running your app. For instance, this might be because the user is doing an important task that may not be stopped until completed, or because you want to pause critical, performance-heavy, or battery-draining processes on the device when the user takes it off for a few minutes. I have found MRTK3 gaze tracking state to be a reliable way to detect user presence, and wrote a little ServiceFramework Service to utilize that.

Profile

(Almost) everything in the ServiceFramework starts with a profile, and so does this service.

public class UserPresenceServiceProfile : BaseServiceProfile<IServiceModule>
{
    [SerializeField]
    private InputActionReference gazeTrackingState;
    
    [SerializeField]
    private float userAwayWaitTime = 3.0f;

    [SerializeField]
    private float userPresentWaitTime = 0.5f;
    
    public InputActionReference GazeTrackingState => gazeTrackingState;
    public float UserAwayWaitTime => userAwayWaitTime;
    public float UserPresentWaitTime => userPresentWaitTime;
}

userAwayWaitTime is the time (in seconds) the service needs to detect the continued user absence before it signals the outside world. This is to prevent events being triggered when users so much as blink, or the eye tracking loses track for a few moments. userPresentWaitTime is the time (also in seconds) the service needs to detect the continued return of the user after absence. gazeTrackingState is a reference to the Tracking State input action of the MRTK Default Input Actions. This is used to get the actual gaze state from the Interaction Manager in Unity’s XR Interaction Toolkit.

Public interface

The service exposes only two items: the current user presence, and an event to tell the outside world the presence has changed.

public interface IUserPresenceService : IService
{
    public bool IsUserPresent { get; }
    public UnityEvent<bool> UserPresenceChanged { get; } 
}

The service itself

I will omit declarations and most of the constructor, as most of it is fairly standard for a Service Framework Service. The only thing worth noting is this line, where we pick up the reference to the gaze tracking state from the profile:

gazeTrackingState = profile.GazeTrackingState;

Enabling event reading

When the service is actually enabled, it only sets up a listener to the gaze tracking state’s action.performed event. Note, this is actually not MRTK specific anymore - gazeTrackingState is an InputActionReference and that’s part of Unity’s Input System.

public override void Enable()
{
    if (isInitialized)
    {
        return;
    }

    isInitialized = true;
    gazeTrackingState.action.performed += GazeTrackingStateChanged;
}

private void GazeTrackingStateChanged(InputAction.CallbackContext ctx)
{
    gazeStateResult = ctx.ReadValue<int>();
}

Now the thing to keep in mind is - if you run this in the editor, GazeTrackingStateChanged gets called a few times and that’s it. When you run this on HoloLens 2, GazeTrackingStateChanged gets really hammered with events. It seems like the eye tracker is firing this event all the time. That’s why the only thing we do is put the value in a field, and let the service’s Update loop handle the logic.

Interpreting values and handling timeouts

In the Update loop, we first check the state value. I have seen that “3” means “eye tracking detected”. I also have seen value “0” when I took off the device, so I have chosen to interpret “3” as “user present” and anything else as “not present”

public override void Update()
{
    var newState = gazeStateResult == 3;

First step: if the detected new state is the same as the current state, remember that last requested state, and exit the method

    if (newState == IsUserPresent)
    {
        lastRequestedState = newState;
        return;
    }

Second step: if there is, however, a new state, the method runs to the second if. If the lastRequestedState does not match the newState yet, that means a state change has taken place. This is registering noting the time the state change has taken place.

    if( newState != lastRequestedState)
    {
        lastRequestedState = newState;
        lastStateChangeTime = Time.time;
    }

So. The first if doesn’t do anything anymore, as newState and IsUserPresent are not equal. But newState and lastRequestedState are equal, so lastStateChangeTime stays fixed. Now the clock starts ticking in the last part start:

    if( Time.time - lastStateChangeTime > (lastRequestedState ? 
         profile.UserPresentWaitTime : profile.UserAwayWaitTime))
    {
        IsUserPresent = lastRequestedState;
        UserPresenceChanged.Invoke(IsUserPresent);
    }
}

If the user does not do anything that makes the state flip again (at which point the first if kicks in again and ‘stops the clock’), IsUserPresent is set and and the event is called. The time to wait before the event indicating state change is determined by whether the presence changes from true to false, or the other way around. It’s not really rocket science.

Some demo code to go with it

I have added a demo scene with a simple behaviour UserPresenceDisplayer that shows how you might use this. The interesting parts are this:

public class UserPresenceDisplayer : MonoBehaviour
{
    private async Task Start()
    {
        audioSource = GetComponent<AudioSource>();
        await ServiceManager.WaitUntilInitializedAsync();
        userPresenceService = 
           ServiceManager.Instance.GetService<IUserPresenceService>();
        userPresenceService.UserPresenceChanged.AddListener(OnUserPresenceChanged);
    }

    private void OnUserPresenceChanged(bool currentPresence)
    {
        displayText.text = $"User is {(currentPresence ? "present" : "away")}";
        audioSource.PlayOneShot(currentPresence ? userPresentClip : userAwayClip);
    }
}

It waits for the Service Manager to be ready, then gets a reference to the service. When events are received, it shows an appropriate message on a floating text, plays a high note when the user presence changes from away to present, and a low note when it changes from present to away.

Concluding words

I have found it is best to set a slightly longer wait time (3-5 seconds) before firing the “user away” event before pausing whatever you want to pause, as false positives can be really annoying to the user. However, if the user returns, you want your app’s functionality back up to speed ASAP, so that is usually a shorter time. However, I can also imagine scenarios where a quick “user away” event is necessary, for instance when an app is used to monitor an exam or something. You can simply make multiple profiles for that without needing to change any code. That’s the beauty of the Service Framework.

Note: so far, this has been tested on HoloLens 2 only. For that, it will require the Eye Gaze Interaction profile being set in the OpenXR Eye Tracking interaction profiles.

And it will, obviously, only work with devices that actually support eye tracking. I will conduct experiments with Magic Leap 2 soon.

Fully working demo, as always, on GitHub.

Using the non native keyboard in touch scenarios in MRTK3

2024-02-09T00:00:00+01:00

History repeats itself - a little under four years ago, I wrote about how the non-native keyboard in MRTK2 could be made to play with the then-new HoloLens 2 - a device that introduced touch instead of air tap to control holograms and UI elements. I devised a simple behaviour that fixed that in early 2020 to make it touch-enabled, and that even made it into the MRTK2. Times have moved on, we are now on MRTK3 and in a multi-platform era. For fun, I have started to rebuild on of my apps - Walk the World - in MRTK3, because this always leads to interesting spelunking into the depths of SDKs. In this case, I was really happy to discover the non-native keyboard had made it into MRTK3. Unfortunately, I also quickly found out the very basic five-line trick I had used to enable touch could not be used anymore, so the non-native keyboard was basically back where it started: only operable by the HoloLens 1 hand ray/air tap combo.

This, of course, would not do. And, also of course, stubborn as I am, I banged against it until it worked again. Only this needs a little more code. But it’s also now a bit more beautiful, as the buttons actually are now animated:

I hope y’all can appreciate the unplanned cameo of a Starling on the bird feeder outside the window :).

TL;DR: gimme the goods

If you are not interested in the why and the how:

Go to the demo project on GitHub
Copy all files under Assets/MRTK3Extensions into your project
Drag the TouchableNonNativeKeyboard prefab on your scene
Have some behaviour call NonNativeKeyboard.Instance.Open();

And you have a keyboard. The TouchableNonNativeKeyboard has been scaled to what I think is a usable size, and also has a MRTK3 RadialView to keep it in view, and my helper behaviour AppearInCenterViewController to make it appear in the center of your view. It works the same as the normal NonNativeKeyboard. In fact, it just is the normal NonNativeKeyboard, just with some added stuff.

Where it all starts

The basic NonNativeKeyboardTouchAdapter is very simple: it changes some settings to the audio component because I think this works better for typing. All MRTK3 buttons have no spatial sounds, so why should these have it. Then it simply loops over all the Button child objects - even the inactive ones - and adds a NonNativeKeyTouchAdapter.

public class NonNativeKeyboardTouchAdapter : MonoBehaviour
{
    private void Awake()
    {
        var defaultAudioComponent = GetComponent<AudioSource>();
        defaultAudioComponent.playOnAwake = false;
        defaultAudioComponent.spatialize = false;
    }

    private void Start()
    {
        var buttons = GetComponentsInChildren<Button>(true);
        foreach (var button in buttons)
        {
            // The search box has an incorrect collider and should not act as a 
            // button anyway
            if (button.gameObject.name != "search")
            {
                button.gameObject.EnsureComponent<NonNativeKeyTouchAdapter>();
            }
        }
    }
}

The NonNativeKeyTouchAdapter actually does all the work - for every key there’s one. Except “search”. I think that actually has a Button behaviour by accident, as its OnClick method goes nowhere.

NonNativeKeyTouchAdapter initialization

At Awake, it calculates a few things related to the button’s animation - where it should start, and where it should end.

private void Awake()
{
    defaultPosition = transform.localPosition;
    animatedPosition = defaultPosition + new Vector3(0, 0, AnimationMovementDelta);
}

In OnEnable, we make sure the button always is at the default position, because it might have been halfway when the key disappeared. The keyboard consists of several panes, and another one appears if you press the “ABC” button or the “&123” button. For the same reason, we must reset the lastClickTime (we don’t want a button to be pressable again too quickly, otherwise it will quickly repeat) and the location of the button collider should also be set. Why this is, I will explain later.

private void OnEnable()
{
    transform.localPosition = defaultPosition;
    lastClickTime = Time.time;
    if (isInitialized)
    {
         buttonCollider.center = buttonColliderDefaultCenter;
    }

    Initialize();
}

Creating a collider

Initialization should only be done once - and why I used OnEnable to kick it off instead of Awake or Start I will explain later as well.

First order of business is, of course, check if we did not already initialize in a different round of OnEnable. Then we make a collider around the button that is a bit smaller than the actual size, to make it less likely the user hits two keys at once. The collider also is moved off-center; a bit ‘backwards’ so you won’t accidentally press ‘through’ the button easily.

private void Initialize()
{
    if (isInitialized)
    {
        return;
    }
    isInitialized = true;

    var rectTransform = GetComponent<RectTransform>();
    buttonCollider = gameObject.EnsureComponent<BoxCollider>();
    var size = new Vector3(
        rectTransform.rect.size.x - ColliderMargin,
        rectTransform.rect.size.y - ColliderMargin, 
        ColliderThickness);
    buttonCollider.size = size;
    buttonColliderDefaultCenter = new Vector3((size.x + ColliderMargin) / 2.0f,
        (-size.y - ColliderMargin) / 2.0f, ColliderZDelta);
    buttonCollider.center = buttonColliderDefaultCenter;

The resulting collider, when made visible, looks like this:

Setting up interaction

Before we do that, we first grab some stuff we will need:

image = GetComponent<Graphic>();
var defaultColor = image.color;
var button = GetComponent<Button>();

Then we add a StatefulInteractable and set up the first event. Using my own blog about the available events, I took the firstSelectEntered event, which is fired when something enters the collider. When this event is launched by a PokeInteractor (i.e. your index finger) and it has not been clicked in the last ReClickDelayTime seconds (1 by default), it fires the normal button’s event, as if it was been clicked ‘the old way’. It also starts a coroutine to animate the button to its pressed position

interactable = gameObject.EnsureComponent<StatefulInteractable>();
interactable.firstSelectEntered.AddListener(selectArgs =>
{
    if (selectArgs.interactorObject is not PokeInteractor ||
        Time.time - lastClickTime < ReClickDelayTime)
    {
        return;
    }

    button.onClick.Invoke();
    StartCoroutine(MoveButton(defaultPosition, animatedPosition));
});

Its mirror image is of course: when the user stops pressing the button, it moves the button back to its ‘unpressed’ position

interactable.lastSelectExited.AddListener(_ =>
{
    StartCoroutine(MoveButton(animatedPosition, defaultPosition));
});

And then there’s this little line:

button.interactable = false;

We don’t want the Button behaviour to interfere with us when using touch. We literally only hijack its OnClick event and then turn it off. Otherwise, hover events go off and hand ray interaction is still possible, giving a pretty confusing experience.

Reinstating the hover color

Unfortunately, turning off the Button behaviour disables all hover events, which is not nice, so we add that back simply using this, using the Button’s highlightedColor property

    interactable.firstHoverEntered.AddListener(hoverArgs =>
    {
        SetColorOnHoverPoke(hoverArgs.interactorObject,
                            button.colors.highlightedColor);
    });
    
    interactable.lastHoverExited.AddListener(hoverArgs =>
    {
        SetColorOnHoverPoke(hoverArgs.interactorObject, defaultColor);
    });
}

private void SetColorOnHoverPoke(IXRHoverInteractor interaction, Color color)
{
    if (interaction is PokeInteractor)
    {
        image.color = color;
    }

Some animation as a finishing touch

In the standard non-native keyboard, there is no animation at all, and neither did my MRTK2 behaviour add that. In the meantime, I am like 4 years along in my experience and ideas about UX, so I thought it cool to make the keys actually move. And they do, using this simple routine using a simple Vector3.Lerp.

private IEnumerator MoveButton(Vector3 startPos, Vector3 endPos)
{
    if (transform.localPosition == endPos)
    {
        yield break;
    }
    const float rate = 1.0f / AnimationTime;
    var i = 0.0f;
    while (i < 1.0f)
    {
        i += Time.deltaTime * rate;
        var newPos = Vector3.Lerp(startPos, endPos, Mathf.SmoothStep(0f, 1f, i));
        transform.localPosition = newPos;
        buttonCollider.center = 
          buttonColliderDefaultCenter - (newPos - defaultPosition);
        yield return null;
    }
}

I actually had to look up how to use that Lerp in one of my very first HoloLens blogs from July 2016 because these days I always use LeanTween - but I did not want to create a dependency on that now. Also: there is something funky about this routine, as you might have noticed - it does something with the button’s position alright, but also something with the collider’s center position. This is because I quickly found out that when you move a button backwards, the collider is dragged along. And if you then just touch the button, you get this stupid effect:

So that’s why that when the button moves backward, the collider moves forward in exactly the opposite direction - with the net result the collider stays in the exact same place, while the visible graphics do not, and the ping-pong effect as shown above does not show.

Now to make sure button and collider positions don’t get messed up by buttons disappearing and re-appearing we keep a reference to the initial position of both collider and button itself, and that is why they are always reset to their starting position at the start of OnEnable.

So why the initialize on Enable and not on Awake?

For that, you actually have to do the spelunking in the MRTK3 I talked about. You see, every button also has a NonNativeValueKey behaviour. This behaviour is a child class of NonNativeKey and that does something funky in its Awake method

protected virtual void Awake()
{
    if (Interactable == null)
    {
        Interactable = GetComponent<StatefulInteractable>();
    }

    // If there is a StatefulInteractable, that is used to trigger the FireKey
    // event. Otherwise the Button is used.
    if (Interactable != null)
    {
        Interactable.OnClicked.AddListener(FireKey);
    }
    else
    {
        if (KeyButton == null)
        {
            KeyButton = GetComponent<Button>();
        }
        if (KeyButton != null)
        {
            KeyButton.onClick.AddListener(FireKey);
        }
    }
}

It checks if there’s a StatefulInteractable around, and if so, it wires the click event not to the button, but to the interactable. And we want it to work like it did, so we can hijack the Button’s OnClick and not have it mess with the StatefulInteractable. This is not a problem for the first keyboard_Alpha panel, as that is active by default, so the Awake events for those buttons already done before we can add the StatefulInteractable. But for the panels that are default inactive, there’s suddenly a StatefulInteractable when they awake. As a workaround, the initialization of my NonNativeKeyTouchAdapter is done on OnEnable, so whatever happens - whenever the button’s NonNativeValueKey buttons awake, they will never find a StatefulInteractable and work as we want them to work.

One more thing

There is actually a bug in the keyboard: the sounds only play for the keyboard_Alpha panel. This is because the MRTK3 KeyboardAudio behaviour only looks for active buttons and as I said before, the other panels are inactive. So on TouchableNonNativeKeyboard I have disabled KeyboardAudio and added behaviour FixedKeyboardAudio. It’s almost identical, but it looks for all buttons (using GetComponentsInChildren

Showing multiple location-based items based on QR codes using MRTK3 and HoloLens 2

2023-12-31T00:00:00+01:00

Once again, I am revisiting the subject of displaying things in HoloLens 2 using QR codes. Developers who tried to use the code from my widely used example for reading QR codes and placing objects on them based upon location and rotation had trouble adapting it, since it basically stops as soon as it recognizes a QR code. I was asked how it could be adapted to continuously scan for objects. I was curious myself, so I took a shot at it. And this is how it works:

The idea is that you can use this to have multiple ‘experiences’ connected to multiple locations, for instance, if you want people to learn about a particular machine, building, or any other thing you can use to stick a QR code on. The QR code makes it possible to tie that experience to a particular location and align it potentially to the QR code’s orientation. You can see the airplane and the capsule taking both position and orientation of the QR code, while the ‘custom experience’ only takes the position and places the ‘experience’ above the QR code, using world orientation.

I am going to give a short recap of how things work, referencing my original articles from early 2021 where possible.

The QRCode service - recap and changes

If you want to know in detail how it works: basically the same as I described here

There are a few changes:

The whole project now runs on MRTK3, so the service is no longer an MRTK2 service, but a RealityCollective ServiceFramework service.
It uses a slightly newer version of Microsoft.MixedReality.QR and Microsoft.VCRTForwarders.140, but the process of installing them via Nuget for Unity is the same.
The service profile now sports an AutoEnable property, which makes the service start scanning for QR codes as soon as it starts.

In the demo project, you will see the service now uses an “AutoStartQRCodeTrackingServiceProfile” with AutoEnable set to true

This automatically starts the service. As you can see in the piece of code below. Basically, only the last three lines are added.

public override void Update()
{
    if (qrTracker == null && accessStatus == 
      QRCodeWatcherAccessStatus.Allowed)
    {
        SetupTracking();
    }
}

private void SetupTracking()
{
    qrTracker = new QRCodeWatcher();
    qrTracker.Updated += QRCodeWatcher_Updated;
    IsInitialized = true;
    Initialized?.Invoke(this, EventArgs.Empty);
    SendProgressMessage("QR tracker initialized");
    if( profile.AutoEnable)
    {
        Enable();
    }
}

Processing QR codes

The scene setup is simple. For every QR code, there’s a Tracker and a TrackerDisplayer.

Tracker

Its main behaviour is called “ContinuousQRTrackerController” and I took the word “continuous” because, unlike in my previous samples, it does not stop the service once it has found a QR code, and you can individually reset it. You can see it looks for a QR code with payload “HLItem1”. You can, of course, change that to any value you like, as long as it matches the payload of the QR code you want to have tracked. Basically, the tracker does the following:

Listen to the QR code tracking service’s QRCodeFound event
Check if this matches the QR code set to its locationQrValue serialized field
If that is the case, show the QR code marker scaled and aligned over the QR code.

Although the last part is actually done by the Spatial Graph Coordinate Setter behaviour.

Some highlights of ContinuousQRTrackerController’s code. At startup, it turns off the marker (more on that later), gets a reference to the QR tracking service, and waits a rather arbitrary 0.25 seconds to give the service time to start up. Then it sets up some events:

private IQRCodeTrackingService QrCodeTrackingService =>
    qrCodeTrackingService ??= ServiceManager.Instance.GetService<IQRCodeTrackingService>();

private async Task Start()
{
    markerHolder = spatialGraphCoordinateSystemSetter.gameObject.transform;
    markerDisplay = markerHolder.GetChild(0).gameObject;
    markerDisplay.SetActive(false);
    ResetTracking(false);
    // Give service time to start;
    await Task.Delay(250);
    if (!QrCodeTrackingService.IsSupported)
    {
        return;
    }   

    QrCodeTrackingService.QRCodeFound += ProcessTrackingFound;
    spatialGraphCoordinateSystemSetter.PositionAcquired += SetPosition;
}

When a QR code is found, we first check if the message has data at all, if the marker is already displayed, or if it has just been reset by the user, in which case we don’t do anything at all. We also check if we haven’t just processed this QR code in the last 200 ms, and then and only then we ask the spatialGraphCoordinateSystemSetter to actually align the marker to the QR code.

private void ProcessTrackingFound(object sender, QRInfo msg)
{
    if (msg == null || markerDisplay.activeSelf || resetTime > Time.time)
    {
        return;
    }

    lastMessage = msg;

    if (msg.Data == locationQrValue &&
        Math.Abs((DateTimeOffset.UtcNow - 
          msg.LastDetectedTime.UtcDateTime).TotalMilliseconds) < 200)
    {
        spatialGraphCoordinateSystemSetter.SetLocationIdSize(msg.SpatialGraphNodeId,
            msg.PhysicalSideLength);
    }
}

The marker is the blueish thing you see appear over the QR code:

ResetTracking is called from the little menus floating over the ‘experiences’, and they allow you to make the particular QR code trackable again. It gives you a two seconds grace period to get away from the QR code - this makes sense, otherwise, the QR code is immediately tracked again and immediately shows again.

public override void ResetTracking()
{
    ResetTracking(true);
}

private void ResetTracking(bool delayed)
{
    if (delayed)
    {
        resetTime = Time.time + 2;
    }

    markerDisplay.SetActive(false);
}

SpatialGraphCoordinateSystemSetter

Basically, this is nearly 100% equal to what I described earlier in this article about upgrading the whole shebang to OpenXR.. It sits on the gameobject below the continuous tracker

QRPoseTrackController

This is a behaviour that ties an object to a tracked QR code.

As you can see, the QRPoseTrackController for the Jet tracks the ContinuousTracker1. It starts as follows:

public class QRPoseTrackController : MonoBehaviour
{
    [SerializeField]
    private BaseTrackerController trackerController;
    
    [SerializeField]
    private bool setRotation = true;
    
    private AudioSource audioSource;
    private Transform childObj;

    private void Start()
    {
        audioSource = GetComponentInChildren<AudioSource>(true);
        childObj = transform.GetChild(0);
        childObj.gameObject.SetActive(false);
        trackerController.PositionSet.AddListener(PoseFound);
    }

Note it actually refers to a BaseTrackerController rather than a ContinuousQRTrackerController - this allows for building other controller logic. It also has an option to not only set location but also rotation. For the airplane and the capsule, this is set to true, for the ‘custom experience’ to false. On startup, it gets the child object, tries to find an optional AudioSource, and adds a listener to the TrackerController’s PositionSet event.

When a position is found, it shows the objects on the QR code’s location, optionally aligns it, and plays a sound. The Task.Yield thing is necessary because the AudioSource is on a game object that is initially disabled (in the Start method it says childObj.gameObject.SetActive(false), right?) and apparently Unity needs a frame to actually activate an AudioSource before it can play the sound.

private void PoseFound(Pose pose)
{
    if (setRotation)
    {
        childObj.SetPositionAndRotation(pose.position, pose.rotation);
    }
    else
    {
        childObj.position = pose.position;
    }

    childObj.gameObject.SetActive(true);
    Task.Run(PlaySound);
}

private async Task PlaySound()
{
    await Task.Yield();
    if(audioSource != null && audioSource.clip != null)
    {
        audioSource.Play();
    }
}

The only thing left is this little method

public void Reset()
{
    trackerController.ResetTracking();
    childObj.gameObject.SetActive(false);
}

This is the method called by the floating reset menu each ‘experience’ has, it basically deletes the whole experience, and resets the controller (giving you the 2-second grace time).

Some concluding words

QR codes are a powerful and simple way to have objects appear at particular locations without having to set up all kinds of holograms in advance. This way, you can quickly set up an ‘experience’, a training scenario, or a kind of guided tour. A bit like a poor man’s Microsoft Dynamics 365 Guides ;). There are a few things to consider when using this code, though:

The QR scanner now runs all the time, so that takes a bit of performance
One QR code can only be used in one location at the same time
The bigger the QR code, the faster and more reliable the scan. I would recommend QR codes no smaller than 10x10cm

Have fun playing with it. The demo project is in this branch of the QRCodeService repo. This concludes my blogging for 2023, I wish you a happy 2024 both in Mixed and real Reality :)

Getting the hand ray end position with MRTK3

2023-11-17T00:00:00+01:00

Like I wrote before, I sometimes feel like I am Microsoft’s one-man Mixed Reality Q&A department, judging by the number of questions I get. I guess it’s becoming common knowledge that I have the tendency to actually answer a lot of those questions ;).

Anyway, after showing how to get the position of the hand while doing an air tap, I thought I was done on this subject. Nope: two different developers wanted to know if I could tell them how to get where the hand ray was projecting on.

Well, I don’t know if I have found the right way, or even the best way, but I at least have found a way. I modified my previous sample a bit (again), so now it not only shows for each hand where the hand itself is during a tap, but also where the end of the hand ray is. This is actually a pretty simple adaptation of the previous code. The start is more or less the same:

private void Start()
{
    handsAggregatorSubsystem =
      XRSubsystemHelpers.GetFirstRunningSubsystem<IHandsAggregatorSubsystem>();
    leftHand.SetActive(false);
    rightHand.SetActive(false);
    findingService = ServiceManager.Instance.
      GetService<IMRTK3ConfigurationFindingService>();

but then comes the interesting part. See, my MRTK3ConfigurationFindingService does not only provide events to check if left or right hands are triggering in some way, but also direct access to the hands themselves. And the hands have a LineRender component in their children:

var rightLineRenderer = findingService.RightHand.
  gameObject.GetComponentInChildren<LineRenderer>(true);
var leftLineRenderer = findingService.LeftHand.
      gameObject.GetComponentInChildren<LineRenderer>(true);

which happens to be the hand ray. And if you want to know where the end of the ray is: simply ask it the position of its last point like this:

var rayPos = leftLineRenderer.
              GetPosition(leftLineRenderer.positionCount -1);

The whole thing that is triggered when you do an air tap with your left hand:

findingService.LeftHandStatusTriggered.AddListener(t=>
{
    leftHand.SetActive(t);
    if (t)
    {
        var rayPos = leftLineRenderer.
          GetPosition(leftLineRenderer.positionCount -1);
        textMesh.text = $"Left hand position: {GetPinchPosition(findingService.LeftHand)}";
        textMesh.text +=
            $"{Environment.NewLine} Left hand ray position: {rayPos}";
        leftHand.transform.position = rayPos;
    }
});

It sets the hand display active (like it already did)
It shows the left hand position (like it already did) and the end position of the left hand ray (new)
It moves the hand display to the hand ray end position (new)

The code for the right hand is omitted, as it’s nearly identical. On a HoloLens 2, it looks like this:

To make sure the ray also hits physical objects (i.e., the spatial map), I have added an ARMeshManager to the project’s camera, as I described here. The caveat is - this hits everything with a collider - not only the spatial map, but also the cube floating in the air. If you want to distinguish with that, you will have to do ray casts along the direction of the LineRender yourself.

Demo project can be downloaded from this MRTKAirTap project branch.

Fixing HoloLens 2 app build failure indicating il2cppFileRoot.txt and LineNumberMappings.json do not exist

2023-11-11T00:00:00+01:00

This seems to be a very rare case, and one that I have not been able to reproduce. It mostly seems to happen when you upgrade from one major Unity release to another. You generate the C++ application from Unity, then build the app using Visual Studio, and at the very end, it fails with something along the lines of:

Il2CppOutputProject\Source\il2cppOutput\Symbols\il2cppFileRoot.txt does not exist Il2CppOutputProject\Source\il2cppOutput\Symbols\LineNumberMappings.json does not exist

There is very little to find online; about the only real source of information I found was this one, from June 2023, in the Unity forums. Basically, the solution is: edit the “Unity Data.vcxproj” and remove the entries describing those files. This, of course, does not work when you make a fresh build or run into this in a CI/CD pipeline, which was exactly how I learned about this when I upgraded Augmedit’s Lumi to a new major Unity solution.

The short version: if you are not interested in the how and why and want this issue fixed and you want it fixed now, just download this file, put it somewhere in your project, build the project again and be done with it forever.

Fixing Unity Data.vcxitems

So the problem is: Unity generates links in the Unity Data.vcxitems to files that are simply not there. And as the solution suggested in the Unity forums says, we can do without. So after my colleague Niek brought the Unity PostProcessBuild attribute to my attention, off I went:

[PostProcessBuild(1)]
public static void FixVcxItemsFile(BuildTarget target, string pathToBuiltProject)
{
   if (target == BuildTarget.WSAPlayer)
   {
      var vcxItemsFile = 
         Directory.GetFileSystemEntries(pathToBuiltProject, 
            "Unity Data.vcxitems",
            SearchOption.AllDirectories).FirstOrDefault();
      if (vcxItemsFile != null)
      {
         FixVcxItemsFile(vcxItemsFile);
      }
   }
}

When the build is done, Unity calls methods in static classes decorated with PostProcessBuild. The number indicates the order in which they are to be called, should you have more than one, but since we don’t, that number is inconsequential. The method gets a build target and the path where the project is built to as parameters. We then search for the file “Unity Data.vcxitems” and if we find it, we are going to fix it.

The offending lines are pretty long and look like this, at least in Lumi:

 Include="$(MSBuildThisFileDirectory)..\Il2CppOutputProject\Source\il2cppOutput\Symbols\il2cppFileRoot.txt">
  true
  true

 Include="$(MSBuildThisFileDirectory)..\Il2CppOutputProject\Source\il2cppOutput\Symbols\LineNumberMappings.json">
  true
  true

And they sit inside an “ItemGroup” element. So the trick is to load the document into an XML processing API, find all “None” elements inside the project group that have an Include attribute that ends with either “il2cppFileRoot.txt” or “LineNumberMappings.json”. So I asked CoPilot to write me an algorithm, that compiled great and even ran beautifully, but unfortunately didn’t do anything - and it also did this very inefficiently. But at least it showed me what API to use in this context, and I came to the following code:

private static void FixVcxItemsFile(string vcxItemsFile)
{
   var projectFile = XDocument.Load(vcxItemsFile);

   var itemsToDelete = projectFile.Descendants().
      FirstOrDefault(node => node.Name.LocalName == "ItemGroup")?.
      Descendants().Where
      (node => node.Name.LocalName == "None" &&
               (node.IncludeAttributeEndsWith("il2cppFileRoot.txt") ||
                node.IncludeAttributeEndsWith("LineNumberMappings.json")));
   if (itemsToDelete != null)
   {
      foreach (var item in itemsToDelete.ToList())
      {
         item.Remove();
      }

      projectFile.Save(vcxItemsFile);
   }
}

So it does exactly what I just wrote - it finds offending items, then tells them to delete themselves, and saves the remaining.

IncludeAttributeEndsWith is a simple extension method that I wrote because the Linq statement is already complex enough:

private static bool IncludeAttributeEndsWith(this XElement element, string contents)
{
   var attr = element.Attribute("Include");
   if (attr == null) return false;
   return attr.Value.EndsWith(contents);
}

And that’s all. Should you ever run into this strange error, you can get around it by using this little helper.

An MRTK3 KeywordRecognitionSubsystem for Magic Leap 2

2023-11-01T00:00:00+01:00

As I have shown already, MRTK3 is very versatile and very well suited for cross-platform development. The architecture is extensible by the usage of Unity subsystems. At Magic Leap, they have used that to implement a version of the Hands subsystem - that allows hand tracking to work - and you can use that by simply selecting a different implementation of the subsystem in the Android MRTK3 profile:

For contrast, this is the HoloLens 2 profile. Here the OpenXR Hands API is selected instead of the Magic Leap one.

Magic Leap 2 also supports keyword recognition - there is a custom API for that in their SDK. It’s things like this that make cross-platform development difficult. Therefore I could not get the speech command in my port of Augmedit’s Lumi product to work. If only someone had thought of making a KeywordRecognitionSubsystem implementation so you could just as easily use keyword recognition as on HoloLens 2…

Well, good news. Someone just did. Yours truly.

Subsystems in MRTK3

I will not even start to pretend I completely understand how subsystems work, but I have a kind of an idea now. In theory, they can consist of 5 classes and an interface:

The subsystem itself - this is the class that actually registers itself as a subsystem
A ‘provider’ (that does the actual work). Both subsystem and provider implement the same interface, and the subsystem basically forwards all actual work to the provider.
A ‘descriptor’ (that is used for registering the class)
A configuration class (allows you to configure parameters of the subsystem)
A ‘Cinfo’ class whose purpose is not entirely clear to me, but it is necessary for registering the subsystem.
An interface that describes the public methods of the subsystem (and the provider)

KeywordRecognitionSubsystem

For KeywordRecognitionSubsystems, there’s already a lot of heavy lifting done. We actually only need to provide a subsystem and a provider and can do so by extending existing base classes for keyword recognition. The entire subsystem therefore looks like this:

[Preserve]
[MRTKSubsystem(
    Name = "MRTKExtensions.MagicLeap.SpeechRecognition",
    DisplayName = "MRTK MagicLeap KeywordRecognition Subsystem",
    Author = "LocalJoost",
    ProviderType = typeof(MagicLeapKeywordRecognitionProvider),
    SubsystemTypeOverride = typeof(MagicLeapKeywordRecognitionSubsystem))]
public class MagicLeapKeywordRecognitionSubsystem : 
                KeywordRecognitionSubsystem
{
#if MAGICLEAP
    [RuntimeInitializeOnLoadMethod(
        RuntimeInitializeLoadType.SubsystemRegistration)]
    static void Register()
    {
        var cinfo = XRSubsystemHelpers.
            ConstructCinfo<MagicLeapKeywordRecognitionSubsystem,
            KeywordRecognitionSubsystemCinfo>();

        if (!Register(cinfo))
        {
            Debug.LogError($"Failed to register the {cinfo.Name} subsystem.");
        }
    }
#endif
}

The MRTKSubsystem describes the subsystem and indicates which Subsystem and Provider are to be connected together. Optionally you can also configure a ConfigType in this attribute. The subsystem itself then only needs a static method decorated with a RuntimeInitializeOnLoadMethod attribute to actually get launched on startup. It uses KeywordRecognitionSubsystemCinfo cinfo class - which is part of the MRTK3, so we don’t have to create this ourselves. As I said before, I don’t know why this is necessary, but sometimes you simply have to ~~do some cargo cult programming~~ follow existing patterns.

The Subsystem provider

The provider does the actual work. The provider derives from the abstract class KeywordRecognitionSubsystem.Provider, which requires us to implement the following methods:

UnityEvent CreateOrGetEventForKeyword(string keyword);
void RemoveKeyword(string keyword);
void RemoveAllKeywords();
IReadOnlyDictionary GetAllKeywords();

Also, there are several life cycle methods we can override coming from KeywordRecognitionSubsystem.Provider - methods that are largely the same as in a behaviour.

Starting/initializing the provider

In the Start method override, I have implemented some code to initialize the voice recognition. Attentive readers will notice that this - and a lot of the following code - is basically a modified version of the runtime voice intents sample on the Magic Leap developer docs - with a few additions by me to take care of some idiosyncrasies I ran into.

[Preserve]
internal class MagicLeapKeywordRecognitionProvider :
  KeywordRecognitionSubsystem.Provider
{
    private int commandId = 0;
    private MLVoiceIntentsConfiguration voiceConfiguration;

    public override void Start()
    {
        base.Start();
        if (voiceConfiguration == null)
        {
            voiceConfiguration = 
                ScriptableObject.CreateInstance<MLVoiceIntentsConfiguration>();
            voiceConfiguration.VoiceCommandsToAdd = 
                new List<MLVoiceIntentsConfiguration.CustomVoiceIntents>();
            voiceConfiguration.AllVoiceIntents = 
                new List<MLVoiceIntentsConfiguration.JSONData>();
            voiceConfiguration.SlotsForVoiceCommands = 
                new List<MLVoiceIntentsConfiguration.SlotData>();
        }

        if (!running)
        {
            MLVoice.OnVoiceEvent += OnVoiceEvent;
        }
    }
}

The line voiceConfiguration.SlotsForVoiceCommands = … is one of those additions that proved to be necessary - if I didn’t add that, I got a null reference error. Note that running is a read-only base class property that is set internally.

Adding or getting keywords

You can see that events and keywords are both added to the internal dictionary, as well as ‘intent’ to the Magic Leap API. GetAllKeywords simply returns the keyword/event dictionary

public override UnityEvent CreateOrGetEventForKeyword(string keyword)
{
    if (!keywordDictionary.ContainsKey(keyword))
    {
        keywordDictionary.Add(keyword, new UnityEvent());
        AddIntentForKeyword(keyword);
        SetupVoiceIntents();
    }
    return keywordDictionary[keyword];
}

public override IReadOnlyDictionary<string, UnityEvent> GetAllKeywords()
{
    return keywordDictionary;
}

keywordDictionary, by the way, is once again a base class property. It is a simple Dictionary.

Removing keywords

Here the same pattern: remove from the internal dictionary, remove intent. Oh, and disconnect any listeners from the event before we toss it out.

public override void RemoveKeyword(string keyword)
{
    if(keywordDictionary.TryGetValue(keyword, out var eventToRemove))
    {
        eventToRemove.RemoveAllListeners();
        keywordDictionary.Remove(keyword);
        voiceConfiguration.AllVoiceIntents.Remove(
            voiceConfiguration.AllVoiceIntents.First(k=> k.value == keyword));
        SetupVoiceIntents();
    }
}

public override void RemoveAllKeywords()
{
    foreach( var eventToRemove in keywordDictionary.Values)
    {
        eventToRemove.RemoveAllListeners();
    }
    keywordDictionary.Clear();
    voiceConfiguration.AllVoiceIntents.Clear();
    SetupVoiceIntents();
}

Stop/destroy

If the subsystem is halted or destroyed, we need to handle things in the Stop and Destroy life cycles method, which are now pretty easy to make:

public override void Stop()
{
    base.Stop();
    MLVoice.OnVoiceEvent -= OnVoiceEvent;
}

public override void Destroy()
{
    base.Destroy();
    RemoveAllKeywords();
    Stop();
}

Internal workings

Now we have set up the framework, there’s some little implementation details left. In the Start method, we connected the OnVoiceEvent method to the MLVoice.OnVoiceEvent event. The implementation is pretty simple: we simply check if a keyword is recognized and if so find the event belonging to it - and then invoke that event, notifying possible external listeners.

private void OnVoiceEvent(in bool wasSuccessful, 
                          in MLVoice.IntentEvent voiceEvent)
{
    if (wasSuccessful)
    {
        if (keywordDictionary.TryGetValue(voiceEvent.EventName, 
                out var value))
        {
            value?.Invoke();
        }
    }
}

AddIntentForKeyword creates the actual intent. The commandId needs to be unique, so we simply use an incrementing integer

private void AddIntentForKeyword(string keyword)
{
    var newIntent = new MLVoiceIntentsConfiguration.CustomVoiceIntents
    {
        Value = keyword,
        Id = (uint)commandId++
    };
    voiceConfiguration.VoiceCommandsToAdd.Add(newIntent);
}

The last method is quite weird. In all methods, you can see that after adding or removing intents, the method SetupVoiceIntents is called. This is because for any change of intents to be recognized, MLVoice.SetupVoiceIntents needs to be called. At least, that seems to be the case. Here’s another idiosyncrasy I found: if an MLVoiceIntentsConfiguration contains zero intents, MLVoice.SetupVoiceIntents throws an exception. So if it’s empty, I make sure there is at least one dummy intent - without an event

private void SetupVoiceIntents()
{
    if (!voiceConfiguration.AllVoiceIntents.Any())
    {
        AddIntentForKeyword("dummyxyznotempty");
    }
    MLVoice.SetupVoiceIntents(voiceConfiguration);
}

Sample code

To show it actually works, I have added a small piece of demo code that allows you to verify this actually works. If you have configured MagicLeapKeywordRecognitionSubsystem as your keyword recognition system in the MRTK3 settings and deploy it to the Magic Leap, you can use see this user interface:

By default, it recognizes three phrases. You can add a “Hello there”, remove it again, remove all, restore the initial commands, and toggle on/off the whole recognizer. Adding commands goes like this:

public void InitStandardPhrases()
{
    RemoveAll();
    keywordRecognitionSubsystem.CreateOrGetEventForKeyword("Good morning").
        AddListener(() => ShowRecognizedCommand("Good morning"));
    keywordRecognitionSubsystem.CreateOrGetEventForKeyword("Nice weather").
        AddListener(() => ShowRecognizedCommand("Nice weather"));
    keywordRecognitionSubsystem.CreateOrGetEventForKeyword("Mixed Reality is cool").
        AddListener(() => ShowRecognizedCommand("Mixed Reality is cool"));
    UpdateRecognizedCommands();
}

Removing keywords goes like this:

public void RemoveHello()
{
    keywordRecognitionSubsystem.RemoveKeyword("Hello there");
    UpdateRecognizedCommands();
}

public void RemoveAll()
{
    keywordRecognitionSubsystem.RemoveAllKeywords();
    UpdateRecognizedCommands();
}

and controlling the keyword recognizer like this:

public void ToggleKeywordRecognition()
{
    if (keywordRecognitionSubsystem.running)
    {
        keywordRecognitionSubsystem.Stop();
    }
    else
    {
        keywordRecognitionSubsystem.Start();
    }
}

Concluding words

Using this keyword recognizer, you can use exactly the same MRTK3 API for creating and responding to keywords as you were used to using on HoloLens, with the WindowsKeywordRecognitionSubsystem. The Magic Leap API is neatly encapsulated, and as far as speech control for your app goes, it doesn’t matter whether it’s running on a HoloLens 2 or a Magic Leap 2. As a consequence,Augmedit Lumi now does support speech recognition on Magic Leap - without any code change for that.

Full code, as (nearly) always, on GitHub.

DotNetByExample - The Next Generation

Making an MRTK3 based HoloLens 2 app run on Magic Leap 2

Upgrade MRTK3

Switch build target to Android

Identify and add necessary packages

Set Magic Leap audio settings

Install Magic Leap Project Setup Tool

Configure settings using the Project Setup Tool

Setting necessary permissions.

Setting the MRTK 3 profile

Configuring Magic Leap OpenXR settings

Turning off the old Unity Input System

Change some input settings

Upgrade packages

Concluding words

A little tool to compare two Unity Manifest files

Talk is cheap, show me the manifest

JSON data structure

Main program

Added scoped registries

Added packages

Different versions:

Concluding words

Logging Mixed Reality app data in Azure Application Insights using the official Microsoft SDK

TelemetryService functions

Using the TelemetryService

Initializing the service

Using the service from code

Automatically trapping application events

Setting up the project from scratch

How it looks like in Azure

So how does this all work?

Initialization

Adding context

Tracking keywords

Tracking an event in code

Concluding words

Unit testing UI interaction with MRTK3

So how do we deal with this?

The requirements

Setting up the project for UI Unit testing

Menu structure

Test class setup

Testing toggle button states

Setting up the test data

Testing a press

The helper methods that do it all

Determining the initial hand position

Initializing the hand

Moving the hand

Some bits & pieces

Making sure all buttons are pressed

Creating a new prefab from a guid

Finding a child object by name

Concluding words.

Detecting user presence using MRTK3 gaze tracking state

Profile

Public interface

The service itself

Enabling event reading

Interpreting values and handling timeouts

Some demo code to go with it

Concluding words

Using the non native keyboard in touch scenarios in MRTK3

TL;DR: gimme the goods

Where it all starts

NonNativeKeyTouchAdapter initialization

Creating a collider

Setting up interaction

Reinstating the hover color

Some animation as a finishing touch

So why the initialize on Enable and not on Awake?

One more thing

Concluding words

Showing multiple location-based items based on QR codes using MRTK3 and HoloLens 2

The QRCode service - recap and changes

Processing QR codes

Tracker

SpatialGraphCoordinateSystemSetter

QRPoseTrackController