<div dir="ltr"><div><div><div><div><div><div><div><div><div><div><div><div><div><div><div><div>Dear all,<br></div><div>(I apologize for a long email!! I think it's better to clearly explain the problem even if it is boring than be short and confusing).<br>
<br></div>I need to solve following kind of set of equations in 3D for the domain of around 250 ^ 3 grid size:<br></div>div (K(grad(v)) - grad (p) = f1 (momentum eqn)<br></div>div (v) = f2 (continuity eqn)<br>
<br></div><div>v a vector of 3 components, p a scalar.<br></div>K is piecewise discontinuous. Say it has two values K1, K2 with ratio K1/K2 reaching up to 10^5.<br></div>f1, f2 are not zero and are functions of the space variable (x,y,z). <br>
</div>This is thus pretty much a standard stokes flow equation (except for the non zero f2).<br><br></div>Now, I implemented two different approaches, each for both 2D and 3D, in MATLAB. It works for the smaller sizes but I have problems solving it for the problem size I need (250^3 grid size).<br>
</div>I use staggered grid with p on cell centers, and components of v on cell faces. Similar split up of K to cell center and faces to account for the variable viscosity case)<br></div>The two approaches I take are:<br><br>
M1) Setting up Ax=b where x stacks up the components of v and the scalar p for all discretized points. Thus A has one of the block matrices with zero diagonal. Then use iterative solvers available. This requires the preconditioner and that's where I had problems. I could not find a suitable precondtioner and iterative solver pair in MATLAB that could produce result for this size of the saddle point problem I have.<br>
<br></div>M2) Using Augmented Lagrangian / Penalty method: "Uncouple" the pressure and velocity; i.e. use a pseudotime step to iteratively update p starting from the initial guess. After each iteration use the new p value to solve for the momentum equation. In this case the Ax=b would have A with no zeros in the diagonal, and we are solving for x which has only the components of v. Here preconditioner such as ILU worked with gmres etc. But for the bigger problem size (the size I need !!) it does not scale up.<br>
<br></div>Now, this is when I started learning petsc and after having played around a bit with it, I've a preliminary implementation of 2D problem with method M1. What I did was basically use DMDA and KSP. To adapt DMDA for staggered grid, I simply used ghost-values as explained in fig 7.5, page 96 of: <a href="http://books.google.co.uk/books?id=W83gxp165SkC&printsec=frontcover&dq=Introduction+to+Numerical+Geodynamic+Modelling&hl=en&sa=X&ei=v6TmUaP_L4PuOs3agJgE&ved=0CDIQ6AEwAA">http://books.google.co.uk/books?id=W83gxp165SkC&printsec=frontcover&dq=Introduction+to+Numerical+Geodynamic+Modelling&hl=en&sa=X&ei=v6TmUaP_L4PuOs3agJgE&ved=0CDIQ6AEwAA</a><br>
</div><div>That means for 2D case, I have a DMDA with 3 dof and set KSP with this DM object and solve the system.<br><br></div>Now my actual question (you are patient :) )<br></div>1) From 2D example, I can see that in 3D the issue again will be to have a proper preconditioner. I saw some discussions in the archive about using PCFieldsplit or schur complement. Before implementing the 3D case I'd like to get some suggestions if I could directly use some combination of solver and preconditioner to get result for my problem size in 3D. Can I use multigrid with proper preconditioner in this case ? And by the way is there a possibility of having the support in petsc for this implementation: (Matt from petsc-dev is one of the authors of the paper, so I thought I could as it here) <br>
<a href="http://hpc.sagepub.com/content/early/2013/03/03/1094342013478640.abstract">http://hpc.sagepub.com/content/early/2013/03/03/1094342013478640.abstract</a><br><br></div>2) The other option is to use M2 (see above), which worked in Matlab for smaller 3D sizes. But the tricky part there would be to tune the pseudotime parameter based on different viscocity values. Furthermore, I'm not sure if this will scale up in parallel to give me the results.<br>
</div>I will have to try, but it'd be nice if you can suggest which of the two methods should I try first ?<br></div><div>My interest in the first method was due to the possibility having an implementation in Petsc of using multigrid with GPU (if that speeds up the whole thing).<br>
<br></div><div>Thanks,<br></div><div>Bishesh<br></div><br><br></div>