Refactor model configuration in MainActivity and update README

vraspar · vraspar · commit 356556a7e6df · 2025-06-03T18:51:37.000-07:00
diff --git a/mobile/examples/phi-3/android/README.md b/mobile/examples/phi-3/android/README.md
@@ -1,13 +1,38 @@
-# Local Chatbot on Android with Phi-3, ONNX Runtime Mobile and ONNX Runtime Generate() API
+# Local Chatbot on Android with ONNX Runtime Mobile and ONNX Runtime Generate() API
 
 ## Overview
 
-This is a basic [Phi-3](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) Android example application with [ONNX Runtime mobile](https://onnxruntime.ai/docs/tutorials/mobile/) and [ONNX Runtime Generate() API](https://github.com/microsoft/onnxruntime-genai) with support for efficiently running generative AI models. This app demonstrates the usage of phi-3 model in a simple question answering chatbot mode.
+This is a flexible Android chatbot application with [ONNX Runtime mobile](https://onnxruntime.ai/docs/tutorials/mobile/) and [ONNX Runtime Generate() API](https://github.com/microsoft/onnxruntime-genai) that supports efficiently running generative AI models. While it uses [Phi-3](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) by default, **it can work with any ONNX Runtime GenAI compatible model** by simply updating the model configuration in the code.
 
 ### Model
-The model used here is [ONNX Phi-3 model on HuggingFace](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4) with INT4 quantization and optimizations for mobile usage.
-
-You can also optimize your fine-tuned PyTorch Phi-3 model for mobile usage following this example [Phi3 optimization with Olive](https://github.com/microsoft/Olive/tree/main/examples/phi3). 
+By default, this app uses the [ONNX Phi-3 model on HuggingFace](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4) with INT4 quantization and optimizations for mobile usage.
+
+### Using Different Models
+**The app is designed to work with any ONNX Runtime GenAI compatible model.** To use a different model:
+
+1. Open `MainActivity.java` in Android Studio
+2. Locate the model configuration section at the top of the class (marked with comments)
+3. Update the `MODEL_BASE_URL` to point to your model's download location
+4. Update the `MODEL_FILES` list to include all required files for your model
+
+Example for a different model:
+```java
+// Base URL for downloading model files (ensure it ends with '/')
+private static final String MODEL_BASE_URL = "https://your-model-host.com/path/to/model/";
+
+// List of required model files to download  
+private static final List<String> MODEL_FILES = Arrays.asList(
+        "config.json",
+        "genai_config.json", 
+        "your-model.onnx",
+        "your-model.onnx.data",
+        "tokenizer.json",
+        "tokenizer_config.json"
+        // Add other required files...
+);
+```
+
+**Note:** The model files will be downloaded to `/data/data/ai.onnxruntime.genai.demo/files` on the Android device.
 
 ### Requirements
 - Android Studio Giraffe | 2022.3.1 or later (installed on Mac/Windows/Linux)
@@ -30,7 +55,7 @@ The current set up supports downloading Phi-3-mini model directly from Huggingfa
 You can also follow this link to download **Phi-3-mini**: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4
 and manually copy to the android device file directory following the below instructions:
 
-#### Steps for manual copying models to android device directory:
+#### Steps for manual copying model files to android device directory:
 From Android Studio:
   - create (if necessary) and run your emulator/device
     - make sure it has at least 8GB of internal storage
@@ -40,7 +65,8 @@ From Android Studio:
   - Open Device Explorer in Android Studio
   - Navigate to `/data/data/ai.onnxruntime.genai.demo/files`
     - adjust as needed if the value returned by getFilesDir() differs for your emulator or device
-  - copy the whole [phi-3](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4) model folder to the `files` directory
+  - copy all the required model files (as specified in `MODEL_FILES` in MainActivity.java) directly to the `files` directory
+    - For the default Phi-3 model, copy files from [here](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4)
 
 ### Step 3: Connect Android Device and Run the app
   Connect your Android Device to your computer or select the Android Emulator in Android Studio Device manager.
diff --git a/mobile/examples/phi-3/android/app/src/main/java/ai/onnxruntime/genai/demo/MainActivity.java b/mobile/examples/phi-3/android/app/src/main/java/ai/onnxruntime/genai/demo/MainActivity.java
@@ -30,6 +30,25 @@
 
 public class MainActivity extends AppCompatActivity implements Consumer<String> {
 
+    // ===== MODEL CONFIGURATION - MODIFY THESE FOR DIFFERENT MODELS =====
+    // Base URL for downloading model files (ensure it ends with '/')
+    private static final String MODEL_BASE_URL = "https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/resolve/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/";
+    
+    // List of required model files to download
+    private static final List<String> MODEL_FILES = Arrays.asList(
+            "added_tokens.json",
+            "config.json",
+            "configuration_phi3.py", 
+            "genai_config.json",
+            "phi3-mini-4k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx",
+            "phi3-mini-4k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx.data",
+            "special_tokens_map.json",
+            "tokenizer.json",
+            "tokenizer.model",
+            "tokenizer_config.json"
+    );
+    // ===== END MODEL CONFIGURATION =====
+
     private EditText userMsgEdt;
     private SimpleGenAI genAI;
     private ImageButton sendMsgIB;
@@ -188,24 +207,11 @@ protected void onDestroy() {
 
     private void downloadModels(Context context) throws GenAIException {
 
-        final String baseUrl = "https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx/resolve/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/";
-        List<String> files = Arrays.asList(
-                "added_tokens.json",
-                "config.json",
-                "configuration_phi3.py",
-                "genai_config.json",
-                "phi3-mini-4k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx",
-                "phi3-mini-4k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx.data",
-                "special_tokens_map.json",
-                "tokenizer.json",
-                "tokenizer.model",
-                "tokenizer_config.json");
-
         List<Pair<String, String>> urlFilePairs = new ArrayList<>();
-        for (String file : files) {
+        for (String file : MODEL_FILES) {
             if (!fileExists(context, file)) {
                 urlFilePairs.add(new Pair<>(
-                        baseUrl + file,
+                        MODEL_BASE_URL + file,
                         file));
             }
         }