使用 nls() 进行非线性拟合在初始参数估计时给出奇异梯度矩阵。为什么？

2024-01-20

这是我第一次尝试在 R 中拟合非线性模型，所以请耐心等待。

Problem

我试图理解为什么nls()给我这个错误：

Error in nlsModel(formula, mf, start, wts): singular gradient matrix at initial parameter estimates

假设

从我在 SO 的其他问题中读到的内容可能是因为：

我的模型是不连续的，或者
我的模型是过度确定的，或者
起始参数值选择错误

所以我请求帮助来解决这个错误。我可以更改型号并仍然使用吗nls()，或者我需要使用nls.lm来自minpack.lm包，正如我在其他地方读过的那样？

我的方法

以下是有关该模型的一些详细信息：

该模型是一个不连续函数，是一种楼梯函数类型（见下图）
一般来说，数量steps模型中的内容可以是可变的，但它们对于特定的拟合事件是固定的

显示问题的 MWE

MWE代码简要说明

step_fn(x, min = 0, max = 1): 返回的函数1区间内（min, max] and 0否则;抱歉这个名字，我现在意识到它并不是真正的阶跃函数......interval_fn()我想会更合适。
staircase(x, dx, dy)：的总和step_fn()功能。dx是宽度向量steps, i.e. max - min, and dy是增量y对于每个step.
staircase_formula(n = 1L)：生成一个formula表示由函数建模的模型的对象staircase()（与使用nls()功能）。
请注意，我使用purrr and glue下例中的包。

Code

step_fn <- function(x, min = 0, max = 1) {

  y <- x
  y[x > min & x <= max] <- 1
  y[x <= min] <- 0
  y[x > max] <- 0

  return(y)
}

staircase <- function(x, dx, dy) {

  max <- cumsum(dx)
  min <- c(0, max[1:(length(dx)-1)])
  step <- cumsum(dy)

  purrr::reduce(purrr::pmap(list(min, max, step), ~ ..3 * step_fn(x, min = ..1, max = ..2)), `+`)
}


staircase_formula <- function(n = 1L) {

  i <- seq_len(n)
  dx <- sprintf("dx%d", i)

  min <-
    c('0', purrr::accumulate(dx[-n], .f = ~ paste(.x, .y, sep = " + ")))
  max <- purrr::accumulate(dx, .f = ~ paste(.x, .y, sep = " + "))

  lhs <- "y"
  rhs <-
    paste(glue::glue('dy{i} * step_fn(x, min = {min}, max = {max})'),
          collapse  = " + ")

  sc_form <- as.formula(glue::glue("{lhs} ~ {rhs}")) 

  return(sc_form)
}


x <- seq(0, 10, by = 0.01)
y <- staircase(x, c(1,2,2,5), c(2,5,2,1)) + rnorm(length(x), mean = 0, sd = 0.2)

plot(x = x, y = y)
lines(x = x, y = staircase(x, dx = c(1,2,2,5), dy = c(2,5,2,1)), col="red")


my_data <- data.frame(x = x, y = y)
my_model <- staircase_formula(4)
params <- list(dx1 = 1, dx2 = 2, dx3 = 2, dx4 = 5,
               dy1 = 2, dy2 = 5, dy3 = 2, dy4 = 1)

m <- nls(formula = my_model, start = params, data = my_data)
#> Error in nlsModel(formula, mf, start, wts): singular gradient matrix at initial parameter estimates

任何帮助是极大的赞赏。

我假设你得到了一个长度为观察值的向量len正如您的示例中绘制的那样，并且您希望确定k跳跃和k跳跃尺寸。（或者也许我误解了你；但你并没有真正说出你想要实现的目标。）下面我将概述一个使用本地搜索的解决方案。我从您的示例数据开始：

x <- seq(0, 10, by = 0.01)
y <- staircase(x,
               c(1,2,2,5),
               c(2,5,2,1)) + rnorm(length(x), mean = 0, sd = 0.2)

解决方案是一个列表职位 and sizes的跳跃。请注意，我使用向量来存储这些数据，因为当你有 20 次跳跃时，定义变量会变得很麻烦。

示例（随机）解决方案：

k <- 5   ## number of jumps
len <- length(x)

sol <- list(position = sample(len, size = k),
            size = runif(k))

## $position
## [1]  89 236 859 885 730
## 
## $size
## [1] 0.2377453 0.2108495 0.3404345 0.4626004 0.6944078

我们需要一个目标函数来计算解决方案的质量。我还定义了一个简单的辅助函数stairs，由目标函数使用。目标函数abs_diff计算拟合序列之间的平均绝对差（由解定义）和y.

stairs <- function(len, position, size) {
    ans <- numeric(len)
    ans[position] <- size
    cumsum(ans)
}

abs_diff <- function(sol, y, stairs, ...) {
    yy <- stairs(length(y), sol$position, sol$size)
    sum(abs(y - yy))/length(y)
}

现在是本地搜索的关键组件：用于改进解决方案的邻域函数。邻域函数采用一个解决方案并对其进行轻微更改。在这里，它会选择一个position or a size并稍微修改一下。

neighbour <- function(sol, len, ...) {
    p <- sol$position
    s <- sol$size

    if (runif(1) > 0.5) {
        ## either move one of the positions ...
        i <- sample.int(length(p),  size = 1)
        p[i] <- p[i] + sample(-25:25, size = 1)
        p[i] <- min(max(1, p[i]), len)        
    } else {
        ## ... or change a jump size
        i <- sample.int(length(s), size = 1)
        s[i] <- s[i] + runif(1, min = -s[i], max = 1)
    }

    list(position = p, size = s)
}

调用示例：此处新解决方案的第一个跳跃大小已更改。

## > sol
## $position
## [1]  89 236 859 885 730
## 
## $size
## [1] 0.2377453 0.2108495 0.3404345 0.4626004 0.6944078
## 
## > neighbour(sol, len)
## $position
## [1]  89 236 859 885 730
## 
## $size
## [1] 0.2127044 0.2108495 0.3404345 0.4626004 0.6944078

我仍然负责本地搜索。

library("NMOF")
sol.ls <- LSopt(abs_diff,
                list(x0 = sol, nI = 50000, neighbour = neighbour),
                stairs = stairs,
                len = len,
                y = y)

我们可以绘制解决方案：拟合线显示为蓝色。

plot(x, y)
lines(x, stairs(len, sol.ls$xbest$position, sol.ls$xbest$size),
      col = "blue", type = "S")

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

r

nonlinearoptimization

nls