Merge pull request #295 from ACEsuit/asp

cortner · web-flow · commit a19c05973275 · 2025-09-26T10:33:46.000-07:00
ASP tutorial
diff --git a/docs/make.jl b/docs/make.jl
@@ -28,7 +28,8 @@ Literate.markdown(_tutorial_src * "/dataset_analysis.jl",
 Literate.markdown(_tutorial_src * "/descriptor.jl",
                   _tutorial_out; documenter = true)
 
-
+Literate.markdown(_tutorial_src * "/asp.jl", 
+                  _tutorial_out; documenter = true)
 # Literate.markdown(_tutorial_src * "/first_example_model.jl", 
 #                   _tutorial_out; documenter = true)
 
@@ -71,6 +72,7 @@ makedocs(;
                 "literate_tutorials/dataset_analysis.md",
                 "tutorials/scripting.md", 
                 "literate_tutorials/descriptor.md",
+                "literate_tutorials/asp.md",
             ],
         "Additional Topics" => Any[
             "gettingstarted/parallel-fitting.md",
diff --git a/docs/src/tutorials/asp.jl b/docs/src/tutorials/asp.jl
@@ -0,0 +1,117 @@
+# # Sparse Solvers
+#
+# This short tutorial introduces the use of the Lasso Homotopy (ASP) and Orthogonal Matching Pursuit (OMP) solvers.
+# These are sparse solvers that compute the entire regularization path, 
+# providing insight into how the support evolves as the regularization parameter changes.
+# For more details on the algorithms and their implementation,
+# see [ActiveSetPursuit.jl](https://github.com/MPF-Optimization-Laboratory/ActiveSetPursuit.jl)
+
+# We start by importing `ACEpotentials` (and possibly other required libraries)
+using ACEpotentials
+using Random, Plots
+using ACEpotentials.Models: fast_evaluator
+using SparseArrays
+using Plots
+
+
+# Since sparse solvers automatically select the most relevant features, we usually begin with a model that has a large basis.
+# Here, for demonstration purposes, we use a relatively small model.
+
+model = ace1_model(elements = [:Si], order = 3, totaldegree = 12)
+P = algebraic_smoothness_prior(model; p = 4)
+
+# Next, we load a dataset. We split the dataset into training, validation, and test sets.
+# The training set is used to compute the solution path, the validation set is used to select the best solution, and the test set is used to evaluate the final model.
+
+_train_data, test_data, _ = ACEpotentials.example_dataset("Zuo20_Si")
+shuffle!(_train_data); 
+_train_data = _train_data[1:100]  # Limit the dataset size for this tutorial
+isplit = floor(Int, 0.8 * length(_train_data))
+train_data = _train_data[1:isplit] 
+val_data = _train_data[isplit+1:end]
+
+# We can now assemble the linear system for the training and validation sets.
+
+At, yt, Wt = ACEpotentials.assemble(train_data, model);
+Av, yv, Wv = ACEpotentials.assemble(val_data, model);
+
+# We can now compute sparse solution paths using the `ASP` and `OMP` solvers.
+# These solvers support customizable selection criteria for choosing a solution along the path.
+#
+# The `select` keyword controls which solution is returned:
+# - `:final` selects the final iterate on the path.
+# - `(:bysize, n)` selects the solution with exactly `n` active parameters.
+# - `(:byerror, ε)` selects the smallest solution whose validation error is within a factor `ε` of the minimum validation error.
+
+# The `tsvd` keyword controls whether the solution is post-processed using truncated SVD.
+# This is often beneficial for `ASP`, as ℓ1-regularization can shrink coefficients toward zero too aggressively.
+
+# The `actMax` keyword controls the maximum number of active parameters in the solution. 
+
+solver_asp = ACEfit.ASP(; P = P, select = :final, tsvd = true, actMax = 100,  loglevel = 0);
+asp_result = ACEfit.solve(solver_asp, Wt .* At, Wt .* yt, Wv .* Av, Wv .* yv);
+
+
+# We can also compute the OMP path, which is a greedy algorithm that selects the most relevant features iteratively.
+
+solver_omp = ACEfit.OMP(; P = P, select = :final, tsvd = false, actMax = 100, loglevel = 0);
+omp_result = ACEfit.solve(solver_omp, Wt .* At, Wt .* yt, Wv .* Av, Wv .* yv);
+
+
+# To demonstrate the use of the sparse solvers, we will generate models with different numbers of active parameters.
+# We can select the final model, a model with 500 active parameters, and a model with a validation error within 1.3 times the minimum validation error.
+# We can use the `ACEfit.asp_select` function to select the desired models from the result.
+
+asp_final = set_parameters!( deepcopy(model), 
+                  ACEfit.asp_select(asp_result, :final)[1]);
+asp_size_50  = set_parameters!( deepcopy(model), 
+                  ACEfit.asp_select(asp_result, (:bysize, 50))[1]);
+asp_error13  = set_parameters!( deepcopy(model), 
+                  ACEfit.asp_select(asp_result, (:byerror, 1.3))[1]);
+
+pot_final = fast_evaluator(asp_final; aa_static = false);
+pot_50 = fast_evaluator(asp_size_50; aa_static = true);
+pot_13 = fast_evaluator(asp_error13; aa_static = true);
+
+err_13 = ACEpotentials.compute_errors(test_data,  pot_13);
+err_50 = ACEpotentials.compute_errors(test_data,  pot_50);
+err_fin = ACEpotentials.compute_errors(test_data, pot_final);
+
+
+# Similarly, we can compute the errors for the OMP models.
+
+omp_final = set_parameters!( deepcopy(model), 
+                  ACEfit.asp_select(omp_result, :final)[1]);
+omp_50  = set_parameters!( deepcopy(model), 
+                  ACEfit.asp_select(omp_result, (:bysize, 50))[1]);
+omp_13  = set_parameters!( deepcopy(model), 
+                  ACEfit.asp_select(omp_result, (:byerror, 1.3))[1]);
+
+pot_fin = fast_evaluator(omp_final; aa_static = false);
+pot_50 = fast_evaluator(omp_50; aa_static = true);
+pot_13 = fast_evaluator(omp_13; aa_static = true);
+
+err_13 = ACEpotentials.compute_errors(test_data,  pot_13);
+err_50 = ACEpotentials.compute_errors(test_data,  pot_50);
+err_fin = ACEpotentials.compute_errors(test_data, pot_fin);
+
+
+# Finally, we can visualize the results along the solution path.
+# We plot the validation error as a function of the number of active parameters for both ASP and OMP.
+
+path_asp = asp_result["path"];
+path_omp = omp_result["path"];
+
+nz_counts_asp = [nnz(p.solution) for p in path_asp];
+nz_counts_omp = [nnz(p.solution) for p in path_omp];
+
+rmses_asp = [p.rmse for p in path_asp];
+rmses_omp = [p.rmse for p in path_omp];
+
+plot(nz_counts_asp, rmses_asp;
+     xlabel = "# Nonzero Coefficients",
+     ylabel = "RMSE",
+     title = "RMSE vs Sparsity Level",
+     marker = :o,
+     grid = true, yscale = :log10, label = "ASP")
+plot!(nz_counts_omp, rmses_omp; marker = :o, label = "OMP")
diff --git a/docs/src/tutorials/index.md b/docs/src/tutorials/index.md
@@ -6,4 +6,6 @@
 * [Smoothness Priors](../literate_tutorials/smoothness_priors.md) : brief introduction to smoothness priors
 * [Basic Dataset Analysis](../literate_tutorials/dataset_analysis.md) : basic techniques to visualize training datasets and correlate such observations to the choice of geometric priors
 * [Descriptors](../literate_tutorials/descriptor.md) : `ACEpotentials` can be used as descriptors of atomic environments or structures, which is described here. 
+* [Sparse Solvers](../literate_tutorials/asp.md) : basic tutorial on using the `ASP` and `OMP` solvers.
+