<div dir="ltr"><div class="gmail_quote"><div dir="ltr">On Thu, Jul 12, 2018 at 11:14 AM Ali Reza Khaz'ali <<a href="mailto:arkhazali@cc.iut.ac.ir">arkhazali@cc.iut.ac.ir</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
  
    
  
  <div text="#000000" bgcolor="#FFFFFF">
    <p>Thanks very much for your answer,</p>
    <p>I am experimenting with SNES for now, however, my purpose is to
      solve a multi-physics large system with unstructured grids
      (Enhanced Oil Recovery simulation in fractured petroleum
      reservoirs to be specific, phase equilibrium and three phase flow
      equations must be solved simultaneously), where to the extend of
      my knowledge, no existing PETSc vector structure can be used.
      Therefore, I started with solving simple ODEs with SNES without
      using those structures.</p></div></blockquote><div>The intention is that you use DMPlex for this. <br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div text="#000000" bgcolor="#FFFFFF">
    <p>An interesting fact about my code samples is, with increasing the
      number of MPI workers, the number of SNES iterations (and hence,
      the precision of the solution) decrease. I'm sure there is a flaw
      in my code (I do not have much MPI experience), but I can't find
      it.<br></p></div></blockquote><div>We really do not have the resources to debug raw MPI code. That is why we have the examples.</div><div><br></div><div>  Thanks,</div><div><br></div><div>     Matt </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div text="#000000" bgcolor="#FFFFFF"><p>
    </p>
    <p>Best wishes,</p>
    <p>Ali<br>
    </p>
    <br>
    <div class="m_-655653169572082924moz-cite-prefix">On 7/12/2018 3:58 PM, Matthew Knepley
      wrote:<br>
    </div>
    <blockquote type="cite">
      <div dir="ltr">
        <div class="gmail_quote">
          <div dir="ltr">On Thu, Jul 12, 2018 at 6:30 AM Ali Reza
            Khaz'ali <<a href="mailto:arkhazali@cc.iut.ac.ir" target="_blank">arkhazali@cc.iut.ac.ir</a>>
            wrote:<br>
          </div>
          <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br>
            <br>
            I have to apologize if I'm asking a noob question. I am
            trying to solve <br>
            y"-(1-x/5)yy'=x with boundary condition of y(1)=2 and
            y(3)=-1 with <br>
            finite difference method. The discretized equations is to be
            solved by <br>
            SNES. I wrote the follwoing code, however, it only works if
            number of <br>
            MPI processes is set to one. for more workers, it does not
            seem to give <br>
            valid results. I'd be grateful if anybody can guide me to
            debug the <br>
            code, please.<br>
          </blockquote>
          <div><br>
          </div>
          <div>Why not just use DMDA? There are lots of examples (SNES
            ex5 for instance) and it</div>
          <div>handles the parallelism for you. This problem could be
            done in a few hours I think.</div>
          <div><br>
          </div>
          <div>  Thanks,</div>
          <div><br>
          </div>
          <div>     Matt</div>
          <div> </div>
          <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
            Regards,<br>
            <br>
            Ali<br>
            <br>
            <br>
            <br>
            static char help[] = "Newton's method for a non-linear ODE
            system.\n\n";<br>
            double h;<br>
            int nParts;<br>
            <br>
            #include <petscsnes.h><br>
            <br>
            /*<br>
            User-defined routines<br>
            */<br>
            extern PetscErrorCode FormJacobian(SNES, Vec, Mat, Mat,
            void*);<br>
            extern PetscErrorCode FormFunction(SNES, Vec, Vec, void*);<br>
            <br>
            int main(int argc, char **argv)<br>
            {<br>
                 SNES           snes;         /* nonlinear solver
            context */<br>
                 KSP            ksp;          /* linear solver context
            */<br>
                 PC             pc;           /* preconditioner context
            */<br>
                 Vec            x, r;          /* solution, residual
            vectors */<br>
                 Mat            J;            /* Jacobian matrix */<br>
                 PetscErrorCode ierr;<br>
                 PetscMPIInt    size;<br>
                 PetscScalar    pfive = 0, *xx;<br>
                 int rank;<br>
                 int i, j;<br>
                 Vec f;<br>
            <br>
            <br>
                 nParts = 30;<br>
                 h = (3.0 - 1.0) / nParts;<br>
            <br>
                 ierr = PetscInitialize(&argc, &argv, (char*)0,
            help); if (ierr) <br>
            return ierr;<br>
                 ierr = MPI_Comm_size(PETSC_COMM_WORLD, &size);
            CHKERRQ(ierr);<br>
                 ierr = SNESCreate(PETSC_COMM_WORLD, &snes);
            CHKERRQ(ierr);<br>
                 ierr = VecCreateMPI(PETSC_COMM_WORLD, PETSC_DECIDE,
            nParts-1, &x); <br>
            CHKERRQ(ierr);<br>
                 ierr = VecDuplicate(x, &r); CHKERRQ(ierr);<br>
                 ierr = MatCreateAIJ(PETSC_COMM_WORLD, PETSC_DECIDE,
            PETSC_DECIDE, <br>
            nParts - 1, nParts - 1, 3, NULL, 3, NULL, &J);
            CHKERRQ(ierr);<br>
            <br>
            <br>
            <br>
            <br>
                 ierr = SNESSetFunction(snes, r, FormFunction, NULL);
            CHKERRQ(ierr);<br>
                 ierr = SNESSetJacobian(snes, J, J, FormJacobian, NULL);
            CHKERRQ(ierr);<br>
            <br>
            <br>
            <br>
                 ierr = SNESGetKSP(snes, &ksp); CHKERRQ(ierr);<br>
                 ierr = KSPGetPC(ksp, &pc); CHKERRQ(ierr);<br>
                 ierr = KSPSetTolerances(ksp, 1.e-8, 1e-16,
            PETSC_DEFAULT, 200); <br>
            CHKERRQ(ierr);<br>
                 ierr = SNESSetTolerances(snes, 1.e-8, 1e-16,
            PETSC_DEFAULT, 200, <br>
            200); CHKERRQ(ierr);<br>
                 ierr = SNESSetType(snes, SNESNEWTONLS); CHKERRQ(ierr);<br>
                 ierr = KSPSetType(ksp, KSPCG); CHKERRQ(ierr);<br>
            <br>
                 ierr = SNESSetFromOptions(snes); CHKERRQ(ierr);<br>
                 ierr = VecSet(x, pfive); CHKERRQ(ierr);<br>
                 ierr = SNESSolve(snes, NULL, x); CHKERRQ(ierr);<br>
                 ierr = SNESGetFunction(snes, &f, 0, 0);
            CHKERRQ(ierr);<br>
            <br>
            <br>
            <br>
                 ierr = VecDestroy(&x); CHKERRQ(ierr); ierr =
            VecDestroy(&r); <br>
            CHKERRQ(ierr);<br>
                 ierr = MatDestroy(&J); CHKERRQ(ierr); ierr =
            SNESDestroy(&snes); <br>
            CHKERRQ(ierr);<br>
                 ierr = PetscFinalize();<br>
                 return ierr;<br>
            }<br>
            <br>
            PetscErrorCode FormFunction(SNES snes, Vec x, Vec f, void
            *ctx)<br>
            {<br>
                 PetscErrorCode    ierr;<br>
                 const PetscScalar *xx;<br>
                 PetscScalar       *ff;<br>
                 PetscInt vstart, vend;<br>
                 int i, j, rank, size, xlength;<br>
                 double xp, xxstart, xxend, x1, x2;<br>
                 MPI_Status MyStat;<br>
            <br>
                 xxstart = 0;<br>
                 xxend = 0;<br>
            <br>
            <br>
                 MPI_Comm_rank(PETSC_COMM_WORLD, &rank);<br>
                 MPI_Comm_size(PETSC_COMM_WORLD, &size);<br>
            <br>
                 ierr = VecGetOwnershipRange(x, &vstart, &vend);
            CHKERRQ(ierr);<br>
                 xlength = vend - vstart;<br>
                 ierr = VecGetArrayRead(x, &xx); CHKERRQ(ierr);<br>
                 ierr = VecGetArray(f, &ff); CHKERRQ(ierr);<br>
                 ierr = VecView(x, PETSC_VIEWER_STDOUT_WORLD);<br>
                 if (size > 1) {<br>
                     if (rank == 0) {<br>
                         ierr = MPI_Send(&xx[xlength - 1], 1,
            MPI_DOUBLE, 1, 0, <br>
            PETSC_COMM_WORLD);<br>
                         ierr = MPI_Recv(&xxend, 1, MPI_DOUBLE, 1,
            1, <br>
            PETSC_COMM_WORLD, &MyStat);<br>
                     }<br>
                     else if (rank == (size - 1)) {<br>
                         ierr = MPI_Recv(&xxstart, 1, MPI_DOUBLE,
            rank - 1, rank - <br>
            1, PETSC_COMM_WORLD, &MyStat);<br>
                         ierr = MPI_Send(&xx[0], 1, MPI_DOUBLE, rank
            - 1, rank, <br>
            PETSC_COMM_WORLD);<br>
                     }<br>
                     else {<br>
                         ierr = MPI_Send(&xx[xlength - 1], 1,
            MPI_DOUBLE, rank + 1, <br>
            rank, PETSC_COMM_WORLD);<br>
                         ierr = MPI_Recv(&xxend, 1, MPI_DOUBLE, rank
            + 1, rank + 1, <br>
            PETSC_COMM_WORLD, &MyStat);<br>
                         ierr = MPI_Send(&xx[0], 1, MPI_DOUBLE, rank
            - 1, rank, <br>
            PETSC_COMM_WORLD);<br>
                         ierr = MPI_Recv(&xxstart, 1, MPI_DOUBLE,
            rank - 1, rank - <br>
            1, PETSC_COMM_WORLD, &MyStat);<br>
                     }<br>
                 }<br>
            <br>
            <br>
            <br>
            <br>
                 for (i = vstart; i < vend; i++) {<br>
                     xp = 1 + i * h;<br>
                     j = i - vstart;<br>
            <br>
            <br>
                     if (i == vstart) x1 = xxstart;<br>
                     else x1 = xx[j - 1];<br>
                     if (i == (vend - 1)) x2 = xxend;<br>
                     else x2 = xx[j + 1];<br>
            <br>
            <br>
                     if (i==0) ff[j] = (x2 - 2 * xx[j] + 2) / (h*h) - (1
            - xp / <br>
            5)*xx[j] * (x2 - 2) / (2 * h) - xp;<br>
                     else if (i==(nParts-2)) ff[j]=(-1 - 2 * xx[j] + x1)
            / (h*h) - <br>
            (1 - xp / 5)*xx[j] * (-1 - x1) / (2 * h) - xp;<br>
                     else ff[j] = (x2 - 2 * xx[j] + x1) / (h*h) - (1 -
            xp / 5)*xx[j] <br>
            * (x2 - x1) / (2 * h) - xp;<br>
                 }<br>
            <br>
                 /* Restore vectors */<br>
                 ierr = VecRestoreArrayRead(x, &xx); CHKERRQ(ierr);<br>
                 ierr = VecRestoreArray(f, &ff); CHKERRQ(ierr);<br>
                 return 0;<br>
            }<br>
            <br>
            PetscErrorCode FormJacobian(SNES snes, Vec x, Mat jac, Mat
            B, void *dummy)<br>
            {<br>
                 const PetscScalar *xx;<br>
                 PetscErrorCode    ierr;<br>
            <br>
                 PetscInt vstart, vend;<br>
                 double rrow[3];<br>
                 int col[3];<br>
                 int i, j, rank, size, xlength;<br>
                 double xp, xxstart, xxend, x1, x2;<br>
                 MPI_Status MyStat;<br>
            <br>
                 ierr = MatZeroEntries(jac); CHKERRQ(ierr);<br>
            <br>
                 ierr = VecGetOwnershipRange(x, &vstart, &vend);
            CHKERRQ(ierr);<br>
                 xlength = vend - vstart;<br>
                 MPI_Comm_rank(PETSC_COMM_WORLD, &rank);<br>
                 MPI_Comm_size(PETSC_COMM_WORLD, &size);<br>
            <br>
                 xxstart = 0;<br>
                 xxend = 0;<br>
            <br>
                 /*<br>
                 Get pointer to vector data.0+1<br>
                 */<br>
                 ierr = VecGetArrayRead(x, &xx); CHKERRQ(ierr);<br>
                 if (size > 1) {<br>
                     if (rank == 0) {<br>
                         ierr = MPI_Send(&xx[xlength - 1], 1,
            MPI_DOUBLE, 1, 0, <br>
            PETSC_COMM_WORLD);<br>
                         ierr = MPI_Recv(&xxend, 1, MPI_DOUBLE, 1,
            1, <br>
            PETSC_COMM_WORLD, &MyStat);<br>
                         xxstart = 1e12;<br>
                     }<br>
                     else if (rank == (size - 1)) {<br>
                         ierr = MPI_Recv(&xxstart, 1, MPI_DOUBLE,
            rank - 1, rank - <br>
            1, PETSC_COMM_WORLD, &MyStat);<br>
                         ierr = MPI_Send(&xx[0], 1, MPI_DOUBLE, rank
            - 1, rank, <br>
            PETSC_COMM_WORLD);<br>
                         xxend = 1e12;<br>
                     }<br>
                     else {<br>
                         ierr = MPI_Send(&xx[xlength - 1], 1,
            MPI_DOUBLE, rank + 1, <br>
            rank, PETSC_COMM_WORLD);<br>
                         ierr = MPI_Recv(&xxend, 1, MPI_DOUBLE, rank
            + 1, rank + 1, <br>
            PETSC_COMM_WORLD, &MyStat);<br>
                         ierr = MPI_Send(&xx[0], 1, MPI_DOUBLE, rank
            - 1, rank, <br>
            PETSC_COMM_WORLD);<br>
                         ierr = MPI_Recv(&xxstart, 1, MPI_DOUBLE,
            rank - 1, rank - <br>
            1, PETSC_COMM_WORLD, &MyStat);<br>
                     }<br>
                 }<br>
            <br>
            <br>
                 for (i = vstart; i < vend; i++) {<br>
                     xp = 1 + i * h;<br>
                     j = i - vstart;<br>
            <br>
                     if (i == vstart) x1 = xxstart;<br>
                     else x1 = xx[j - 1];<br>
                     if (i == (vend - 1)) x2 = xxend;<br>
                     else x2 = xx[j + 1];<br>
            <br>
                     if (i == 0) {<br>
                         rrow[0] = -2 / (h*h) - (1 - xp / 5)*(x2 - 2) /
            (2 * h);<br>
                         rrow[1] = 1 / (h*h) - (1 + xp / 5)*xx[j] / (2 *
            h);<br>
                         col[0] = i;<br>
                         col[1] = i + 1;<br>
                         ierr = MatSetValues(jac, 1, &i, 2, col,
            rrow, <br>
            INSERT_VALUES); CHKERRQ(ierr);<br>
                     }<br>
                     else if (i == (nParts-2)) {<br>
                         rrow[0] = 1 / (h*h) + (1 + xp / 5)*xx[j] / (2 *
            h);<br>
                         rrow[1] = -2 / (h*h) - (1 - xp / 5)*(-1 - x1) /
            (2 * h);<br>
                         col[0] = i - 1;<br>
                         col[1] = i;<br>
                         ierr = MatSetValues(jac, 1, &i, 2, col,
            rrow, <br>
            INSERT_VALUES); CHKERRQ(ierr);<br>
                     }<br>
                     else {<br>
                         rrow[0] = 1 / (h*h) + (1 + xp / 5)*xx[j] / (2 *
            h);<br>
                         rrow[1] = -2 / (h*h) - (1 - xp / 5)*(x2 - x1) /
            (2 * h);<br>
                         rrow[2] = 1 / (h*h) - (1 + xp / 5)*xx[j] / (2 *
            h);<br>
                         col[0] = i-1;<br>
                         col[1] = i;<br>
                         col[2] = i + 1;<br>
                         ierr = MatSetValues(jac, 1, &i, 3, col,
            rrow, <br>
            INSERT_VALUES); CHKERRQ(ierr);<br>
                     }<br>
                 }<br>
            <br>
            <br>
            <br>
                 /*<br>
                 Restore vector<br>
                 */<br>
                 ierr = VecRestoreArrayRead(x, &xx); CHKERRQ(ierr);<br>
            <br>
                 /*<br>
                 Assemble matrix<br>
                 */<br>
                 B = jac;<br>
                 ierr = MatAssemblyBegin(B, MAT_FINAL_ASSEMBLY);
            CHKERRQ(ierr);<br>
                 ierr = MatAssemblyEnd(B, MAT_FINAL_ASSEMBLY);
            CHKERRQ(ierr);<br>
                 ierr = MatAssemblyBegin(jac, MAT_FINAL_ASSEMBLY);
            CHKERRQ(ierr);<br>
                 ierr = MatAssemblyEnd(jac, MAT_FINAL_ASSEMBLY);
            CHKERRQ(ierr);<br>
            <br>
                 return 0;<br>
            }<br>
            <br>
          </blockquote>
        </div>
        <br clear="all">
        <div><br>
        </div>
        -- <br>
        <div dir="ltr" class="m_-655653169572082924gmail_signature" data-smartmail="gmail_signature">
          <div dir="ltr">
            <div>
              <div dir="ltr">
                <div>What most experimenters take for granted before
                  they begin their experiments is infinitely more
                  interesting than any results to which their
                  experiments lead.<br>
                  -- Norbert Wiener</div>
                <div><br>
                </div>
                <div><a href="http://www.caam.rice.edu/%7Emk51/" target="_blank">https://www.cse.buffalo.edu/~knepley/</a><br>
                </div>
              </div>
            </div>
          </div>
        </div>
      </div>
    </blockquote>
    <br>
  </div>

</blockquote></div><br clear="all"><div><br></div>-- <br><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div>What most experimenters take for granted before they begin their experiments is infinitely more interesting than any results to which their experiments lead.<br>-- Norbert Wiener</div><div><br></div><div><a href="http://www.caam.rice.edu/~mk51/" target="_blank">https://www.cse.buffalo.edu/~knepley/</a><br></div></div></div></div></div></div>