From owner-chemistry@ccl.net Wed Oct 19 04:08:01 2011 From: "Konrad Hinsen hinsen^^cnrs-orleans.fr" To: CCL Subject: CCL: Science code manifesto - just publish the code Message-Id: <-45690-111019031617-22435-fHCrGHo0rsawQxH2erEMuQ^-^server.ccl.net> X-Original-From: Konrad Hinsen Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset=ISO-8859-1; format=flowed; delsp=yes Date: Wed, 19 Oct 2011 09:16:02 +0200 Mime-Version: 1.0 (Apple Message framework v936) Sent to CCL by: Konrad Hinsen [hinsen**cnrs-orleans.fr] On 18 oct. 11, at 20:32, Vincent Leroux vincent.leroux- -loria.fr wrote: > Wherever access to the code is necessary to reproduce results, it > *must* be provided one way or another (e.g. as supporting > information) if it is not freely available already, this is just > plain obvious to me. > > Wondering why a manifesto would be needed in order to do just that... It is needed because "just that" does not happen today. I get a lot of computational science papers to review, but none of them had any code attached (not even small analysis scripts), nor any explanation of how I could obtain it. Even for my own articles, I rarely provide scripts because most journals don't want them. One day I'll put all that on my Web site. As a reviewer, I insist that authors provide a full explanation of all algorithms they use (usually through literature references of course), and that they state which versions of which programs were used. I often ask them to consider making code and data available, but that has happened only in one case. I would like to be able to request such information more firmly, but I don't see much of a basis for that at the moment: it's neither a general habit nor something encouraged by the author guidelines of the major scientific journals. -- --------------------------------------------------------------------- Konrad Hinsen Centre de Biophysique Molculaire, CNRS Orlans Synchrotron Soleil - Division Expriences Saint Aubin - BP 48 91192 Gif sur Yvette Cedex, France Tel. +33-1 69 35 97 15 E-Mail: research at khinsen dot fastmail dot net http://dirac.cnrs-orleans.fr/~hinsen/ --------------------------------------------------------------------- From owner-chemistry@ccl.net Wed Oct 19 08:19:01 2011 From: "Kalju Kahn kalju[A]chem.ucsb.edu" To: CCL Subject: CCL:G: anharmonic vibrational frequencies and IR intensities Message-Id: <-45691-111019003841-23674-CIuap1p4+2HjH+sF2M3uKA(-)server.ccl.net> X-Original-From: "Kalju Kahn" Content-Transfer-Encoding: 8bit Content-Type: text/plain;charset=iso-8859-1 Date: Tue, 18 Oct 2011 21:38:32 -0700 MIME-Version: 1.0 Sent to CCL by: "Kalju Kahn" [kalju!A!chem.ucsb.edu] Dear Steve, The answer partially depends on the QM model; oftentimes anharmonic frequencies are provided for methods for which analytical second derivatives are available. For example, Gaussian supports anharmonic fundamentals at HF, DFT, and MP2 level ... but not at the CCSD level. I believe that most program that allow for anharmonic fundamental frequencies should be able to produce overtone and combination frequencies as well. It may be not on by default; in MOLPRO, set COMBI=1; in CFOUR, request ANHARM=FULLQUARTIC. NWChem's VSCF module can be used to calculate anharmonic frequencies at any level of theory using numeric grid, but dipole derivatives are available only at the SCF level. A standalone program SPECTRO allows calculation of anharmonic infrared spectra from energy and dipole moment derivatives. People sometimes write their own codes to calculate anharmonic frequencies and intensities based on ab initio energies. For example, JCP recently published our convergence study on anharmonic IR intensities in hydrogen fluoride (J. Chem. Phys. 135, 131103 (2011)). The Mathematica code that evaluates fundamental and overtone intensities from the anharmonic force field and dipole function is available upon request ... just contact me directly. Hope this helps, Kalju > Sent to CCL by: Steve Williams [willsd]^[APPSTATE.EDU] > I am interested in methods for ab initio computation of IR spectra and > am curious about codes that can perform some or all of the steps below. > I know about scale factors for frequencies and force constants to > improve harmonic frequencies, and this is not what I'm interested in > here. I'd like to find out about what ab initio codes can: > 1) Compute anharmonic frequencies for fundamental modes, > 2) Compute anharmonic frequencies for overtones and combination bands, > 3) Compute anharmonic IR intensities for fundamental modes, > and 4) Compute anharmonic IR intensities for overtones and combination > bands. > > I know the following: Gaussian and Qchem can do 1 and 2, but not the > intensities in 3 and 4. > GAMESS-US and Cfour can do 1,2,3, and 4. > Molpro can do 1 and possibly 2, but I'm not sure about 3 or 4 (I don't > use molpro and I did find in the online manual explict statements about > overtones, combination bands, and IR intensities.) > > I'd appreciate any information about any ab initio codes with > capabilities in any of these areas. > > Thanks, > Steve Williams> > > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Dr. Kalju Kahn Department of Chemistry and Biochemistry UC Santa Barbara, CA 93106 From owner-chemistry@ccl.net Wed Oct 19 08:54:01 2011 From: "Laurence Cuffe cuffe(~)mac.com" To: CCL Subject: CCL:G: Science code manifesto Message-Id: <-45692-111019041206-23975-wlC+xf0E2JVOpE7UF0Tl4g]-[server.ccl.net> X-Original-From: Laurence Cuffe Content-type: multipart/alternative; boundary="Boundary_(ID_g/oUSGw8BdMBhZ26s5JmZg)" Date: Wed, 19 Oct 2011 08:11:19 +0000 (GMT) MIME-version: 1.0 Sent to CCL by: Laurence Cuffe [cuffe^mac.com] --Boundary_(ID_g/oUSGw8BdMBhZ26s5JmZg) Content-type: text/plain; charset=utf-8; format=flowed Content-transfer-encoding: quoted-printable On Oct 17, 2011, at 08:15 AM, "Jo=C3=A3o Brand=C3=A3o jbrandao+/-ualg.pt" = wrote: > Sorry, but I disagree. > > "but if there are real concerns about the work, it is necessary that the= scientific > community can look seriously at the code to see exactly what the program= is doing." > > In my opinion: > If anyone has real concerns about any work, the best way for science dev= elopment is to write (or use) a different code and compare the results. > > Jo=C3=A3o Brand=C3=A3o =20 It has happened previously that the same calculation run on two different = software packages has resulted in different results. This can happen when = different teams interpret research methods differently. If we can examine= the code, then we can see what has been done. If we cannot examine the c= ode, we have no basis other than reputation on which to decide which is co= rrect. Basing decisions on reputation only,=EF=BB=BF makes for poor science. =20 All the best Laurence Cuffe. > > > > > Em 16-10-2011 22:33, Brian Salter-Duke brian.james.duke.:.gmail.com escr= eveu: >> Sent to CCL by: Brian Salter-Duke [brian.james.duke(~)gmail.com] >> The review of manuscripts by referees prior to publication is just a >> small part of what this is all about. The main point is about proper >> review by the scientific community after publication. There, while not >> everyone has access to a 600MHz NMR or has paid to get Gaussian, some >> people in the community will have. The guys who paid for Gaussian >> however, may not have bought the code and in some cases, it is >> impossible to buy the code, as they are commercial secrets. This >> manifesto is saying that commercial secrets are not compatible with >> good science. >> >> Of course the need to this kind of review is not common, but if there >> are real concerns about the work, it is necessary that the scientific >> community can look seriously at the code to see exactly what the >> program is doing. >> >> Brian. >> >> On Sun, Oct 16, 2011 at 8:05 PM, Sebastian Kozuch kozuchs^_^yahoo.com >> wrote: >> =20 >>> I like the idea of open software, and in principle I support the propo= sal of >>> having access to the software code for the possibility of full review = of a >>> theoretical work. However, I feel that this is more wishful thinking t= han >>> real scientific life. >>> >>> Let=E2=80=99s say that I receive a manuscript for review that uses pro= gram X, which >>> I may have access to its code now (since I=E2=80=99m the reviewer). Do= I have to >>> spend a week trying to understand its algorithms to check if the frequ= encies >>> are correctly calculated? For me, most (if not all) the programs are i= n >>> practice black boxes, as I=E2=80=99m hardly a programmer. Therefore, o= pen or closed >>> software (usually) doesn=E2=80=99t make me any difference, except from= a >>> philosophical perspective. >>> >>> Now, let me consider an analogy from experimental chemistry. I have to >>> review a manuscript where the authors tested some compound with a 600 = MHz >>> NMR. My lab is much more humble, and I only have access to a 400 MHz N= MR. Id >>> est, I cannot reproduce the results of the paper that I=E2=80=99m revi= ewing. Is here >>> any difference compared to the theoretical case? >>> >>> I had in a couple of cases problems to reproduce the results of papers= , >>> mostly because the authors didn=E2=80=99t provide enough information. = However, for >>> 100% reproducibility I need 100% the same conditions (and a lot of res= ources >>> and time). This makes the selection of the software just a small issue= --Boundary_(ID_g/oUSGw8BdMBhZ26s5JmZg)-- From owner-chemistry@ccl.net Wed Oct 19 09:29:01 2011 From: "Andrew Dalke dalke a dalkescientific.com" To: CCL Subject: CCL: Science code manifesto Message-Id: <-45693-111019053855-16033-l/GvznfEo2jKzjpeqd4OjQ:-:server.ccl.net> X-Original-From: Andrew Dalke Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset=iso-8859-1 Date: Wed, 19 Oct 2011 11:38:48 +0200 Mime-Version: 1.0 (Apple Message framework v1084) Sent to CCL by: Andrew Dalke [dalke^^^dalkescientific.com] On Oct 18, 2011, at 4:17 PM, Adri Cereto Massagu adrian.cereto,+,gmail.com wrote: > I don't think the manifesto is at odds with FSF. GPL'd software can be sold at any price, but its source code must be available for those who own the software at no further cost. And someone who has bought some GPL software is allowed to redistribute it for free, so researchers using it for a paper would be able to provide the software to reviewers and readers of the paper at no cost. Abstract: How much can the paper authors ask for access to the source code? How much can the curators charge? What should the curator do if the curated software contains a license violation? If I write a paper which depends on software for its analysis, and others should have access to the software as part of effective peer review, then how much can I charge others to get access to the software? US $1 billion? The FSF says I can charge as much as you want, and that freedom is one of the core freedoms of free software. The philosophy that others need access to my source code to provide good peer review has the implicit assumption that I will provide the software at a non-prohibitive cost. There is clearly a tension between these two viewpoints. This manifesto says nothing of what that cost might be, nor even that it might be an issue. What should be the cost to get access to source code from the author, or from the curator? Does the curator get no-cost access to it as a condition of publication? Doesn't any limit on cost curtail what the FSF says is my freedom to charge as much as I want? Remember, the FSF encourages software freedom. I argue that scientific communication has overlapping but different goals. Science communications needs to have a low cost so that many people can get access to it. The FSF is only concerned about what happens *after* someone gets access to software. This is of course similar to (most) scientific papers. There the author gives the curator the right to redistribute the paper without paying royalties, and the curator can charge effectively any price for it. Most paper publishers want to maximize revenue, and therefore set high but not prohibitive prices. The software author may have other concerns. Interestingly, the software curator takes on a more difficult challenge than a paper curator. The authors of a paper (with a few exceptions usually well-covered by fair use exceptions) are the only copyright holders of a paper. More often though, the accompanying software has many more copyright holders. That can lead to problems. Consider the CDK chemistry toolkit. The package contains many copyright holders, including those from third-party libraries which it incorporates. A few years ago the CDK was in minor violation of the LGPL requirement of some of those libraries. (It omitted the credit required by those licenses.) This was quickly fixed once pointed out. I can easily imagine cases where it can't be easily fixed. The curator takes on the risk that someone else, who is a copyright holder to the software in question but not a paper author, may challenge the right of the curator to distribute the software. How does the curator resolve the violation, especially if the original author doesn't want to be involved? Does the curator remove the software in question? If so, and if you insist that the software must be available in order to do correct peer review, then should the corresponding paper also be withdrawn? As I said before, these are solvable. I bring them up because I encourage people to distribute their source code along with the paper, and to be aware that it's not a simple, clear issue. Andrew dalke[*]dalkescientific.com From owner-chemistry@ccl.net Wed Oct 19 11:30:00 2011 From: "George Fitzgerald George.Fitzgerald/./accelrys.com" To: CCL Subject: CCL:G: Science code manifesto Message-Id: <-45694-111019105032-13582-A56iytuT9Uxzz2A9XmVkjA]^[server.ccl.net> X-Original-From: George Fitzgerald Content-Language: en-US Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="utf-8" Date: Wed, 19 Oct 2011 07:50:08 -0700 MIME-Version: 1.0 Sent to CCL by: George Fitzgerald [George.Fitzgerald[A]accelrys.com] As a member of a company that makes money from selling software, I probably have a different outlook on this than most CCLers. But I have one very practical question: as a reviewer, do you really have the time and expertise to review 1000s of lines of source code? I find that properly reviewing a paper already takes several hours. From experience I know that reviewing somebody's source code can take days. Can anybody give me an example of what you'd even look for in the source code? I'm thinking back to, for example, Peter Gill's 'PRISM' method for Gaussian integration, or Benny Johnson and DFT analytic 2nd derivatives. Are you claiming that those papers shouldn’t have been published without the reviewer reviewing the code? -george From owner-chemistry@ccl.net Wed Oct 19 13:01:00 2011 From: "Berger Raphael berger]![chem.helsinki.fi" To: CCL Subject: CCL: Science code manifesto Message-Id: <-45695-111019123615-3729-FlOevUwrK3zxa4R+Ew/+vw : server.ccl.net> X-Original-From: Berger Raphael Content-Type: TEXT/PLAIN; format=flowed; charset=US-ASCII Date: Wed, 19 Oct 2011 19:35:48 +0300 (EEST) MIME-Version: 1.0 Sent to CCL by: Berger Raphael [berger[a]chem.helsinki.fi] Hello CCL-ers! in my opinion the results of basic research, including computer programms, must be accessible freely to every human beeing who whishes so. In the first instance science is founded by the whole community in a state or most of all via taxes, but also in an abstract manner since sicience and cultre are properties of the whole society. Not speaking about moral or philosphical principles. Last but not least it turns out, that commercial QC programms grow old and get inflexible much faster than free codes. Of course its a pity, that able scientists who work hard to contribute to high-level computer programms cannot earn their money more easily. best regards From owner-chemistry@ccl.net Wed Oct 19 13:36:00 2011 From: "Berger Raphael berger/a\chem.helsinki.fi" To: CCL Subject: CCL:G: Science code manifesto Message-Id: <-45696-111019122525-12544-Ts7fJkuPpuvMN/5OmZw3qw(a)server.ccl.net> X-Original-From: Berger Raphael Content-Type: MULTIPART/MIXED; BOUNDARY="-696203250-1301958500-1319041514=:1321" Date: Wed, 19 Oct 2011 19:25:14 +0300 (EEST) MIME-Version: 1.0 Sent to CCL by: Berger Raphael [berger]^[chem.helsinki.fi] This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. ---696203250-1301958500-1319041514=:1321 Content-Type: TEXT/PLAIN; charset=utf-8; format=flowed Content-Transfer-Encoding: 8BIT Hello CCL-ers! in my opinion the results of basic research, including computer programms, must be accessible freely to every human beeing who whishes so. In the first instance science is founded by the whole community in a state or most of all via taxes, but also in an abstract manner since sicience and cultre are properties of the whole society. Not speaking about moral or philosphical principles. Last but not least it turns out, that commercial QC programms grow old and get inflexible much faster than free codes. Of course its a pity, that able scientists who work hard to contribute to high-level computer programms cannot earn their money more easily. best regards On Wed, 19 Oct 2011, George Fitzgerald George.Fitzgerald/./accelrys.com wrote: > > Sent to CCL by: George Fitzgerald [George.Fitzgerald[A]accelrys.com] > As a member of a company that makes money from selling software, I probably have a different outlook on this than most CCLers. But I have one very practical question: as a reviewer, do you really have the time and expertise to review 1000s of lines of source code? I find that properly reviewing a paper already takes several hours. From experience I know that reviewing somebody's source code can take days. > > Can anybody give me an example of what you'd even look for in the source code? I'm thinking back to, for example, Peter Gill's 'PRISM' method for Gaussian integration, or Benny Johnson and DFT analytic 2nd derivatives. Are you claiming that those papers shouldn’t have been published without the reviewer reviewing the code? > > -george> > > ---696203250-1301958500-1319041514=:1321-- From owner-chemistry@ccl.net Wed Oct 19 14:10:00 2011 From: "Jaroslav Kalinovski kofeinu:+:hotmail.com" To: CCL Subject: CCL:G: Science code manifesto Message-Id: <-45697-111019125949-25130-zWXnxciit+WExMXENSovUw(_)server.ccl.net> X-Original-From: Jaroslav Kalinovski Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="Windows-1252" Date: Wed, 19 Oct 2011 19:59:39 +0300 MIME-Version: 1.0 Sent to CCL by: Jaroslav Kalinovski [kofeinu-$-hotmail.com] Hi, I really do not understand... for me, the source code I am writing is like an equipment in the laboratory. Do you really need someone else equipment to make a proper review? Maybe in this case it is easy to send the code but rules are the same. Do you ask for particle accelerator when reviewing paper about experiment with one? If someone is having troubles with reproducing my results, one always can write to me and simply ask for the code but I do not feel I HAVE TO publish code explicitly. In the end it is my property, I can describe algorithm, points of theory but why should I give the code? No one is watching at the hands of experimentalists while reviewing their papers. I think people are forgetting that code is just a tool not a research result. It is true that relying only on reputation is not the perfect way but it works for other disciplines. In the end we always have to put on some trust in authors. That is all for me. Best regards, J.Kalinowski --- Laboratory of Physical Chemistry University of Helsinki --- Logic is a systematic method of coming to the wrong conclusion with confidence. ---------------------------------------- > From: owner-chemistry ~ ccl.net > To: kofeinu ~ hotmail.com > Subject: CCL:G: Science code manifesto > Date: Wed, 19 Oct 2011 07:50:08 -0700 > > > Sent to CCL by: George Fitzgerald [George.Fitzgerald[A]accelrys.com] > As a member of a company that makes money from selling software, I probably have a different outlook on this than most CCLers. But I have one very practical question: as a reviewer, do you really have the time and expertise to review 1000s of lines of source code? I find that properly reviewing a paper already takes several hours. From experience I know that reviewing somebody's source code can take days. > > Can anybody give me an example of what you'd even look for in the source code? I'm thinking back to, for example, Peter Gill's 'PRISM' method for Gaussian integration, or Benny Johnson and DFT analytic 2nd derivatives. Are you claiming that those papers shouldnt have been published without the reviewer reviewing the code? > > -george> > From owner-chemistry@ccl.net Wed Oct 19 14:45:00 2011 From: "Goedele Roos groos]=[vub.ac.be" To: CCL Subject: CCL: Poisson-Boltzmann pka calculations Message-Id: <-45698-111019131523-8087-1J0ATu9zCQCMGWp+EdaQ+g^^^server.ccl.net> X-Original-From: Goedele Roos Date: Wed, 19 Oct 2011 19:15:15 +0200 Sent to CCL by: Goedele Roos [groos|-|vub.ac.be] Dear all, Has someone experience with poisson-Boltzmann pka calculations? I noticed there are several PB solvers downloadable like H++ and Adaptive Poisson-Boltzmann Solver (APBS)...which ones are good to use? How well does PB calculations perform for cysteine pka's in proteins? Thanks in advance for help, Goedele _____________________________ Goedele Roos, PhD Department of General Chemistry http://we.vub.ac.be/~algc/ VIB, Department of Structural Biology, Vrije Universiteit Brussel | http://www.structuralbiology.be/ Brussels Center for Redox Biology | http://redox.vub.ac.be/ Vrije Universiteit Brussel, Building G, room 10G714 Pleinlaan 2, 1050 Brussels,Belgium phone: +32 2 6293312 fax: +32 2 6293317 E-mail: groos##vub.ac.be From owner-chemistry@ccl.net Wed Oct 19 15:20:00 2011 From: "Mihaly Mezei Mihaly.Mezei ~~ mssm.edu" To: CCL Subject: CCL: Science code manifesto Message-Id: <-45699-111019125641-24024-GLcPSipGKx7OypmaUPsMYQ]*[server.ccl.net> X-Original-From: Mihaly Mezei Content-disposition: inline Content-language: en Content-transfer-encoding: 7BIT Content-type: text/plain; charset=us-ascii Date: Wed, 19 Oct 2011 12:56:18 -0400 MIME-version: 1.0 Sent to CCL by: Mihaly Mezei [Mihaly.Mezei_+_mssm.edu] While I am putting most of the software (source code included) used in in my papers on my website, I think REQUESTING the source code to be published/made available is too strong a requirement. This would be equivalent to requesting the circuit-board diagrams of the various instruments and/or the source code of the data acquisition programs used in experimental work. What IS important is the specification of the algorithms and the parameter set used. Examining the source code (assuming that it is feasible to pore through the thousands of lines of code) is only warranted if either fraud or error is suspected. Suspecting fraud without specific reason is just plain wrong. The same way that we assume that the graphs in an experimental paper properly represent the raw data, we should trust that the code represents the authors honest effort to implement the algorithms in case. If there is solid reason to suspect fraud then there are various avenues to pursue such investigation and that would certainly involve requesting the code. As I said before, the default assumption should be trust. However, what COULD be requested as part of the protocol is the description of the various tests the author performed to demonstrate that the code indeed implements the algorithm described. Suspicion of error can come form two sources: (a) something 'does not make sense' and (b) I repeated the calculation with an other code and got different results. In the first case, if you are a reviewer, you can ask the author for clarification or, if you read the paper after publication, ask the author and if the response is unsatisfactory, write a letter to the editor commenting on the paper. One of these may lead to an acceptable explanation or the author discovering an error in a code. In the second case, the the two groups involved can work together in determining the source of the discrepancy - this is much more efficient than one grou examining the other group's code. Such cooperative error searc can be done without examining the source code of the other group by comparing partial results - that could lead to pinpointing the source of the discrepancy (I speak from experience here). Mihaly Mezei Department of Structural and Chemical Biology, Mount Sinai School of Medicine Voice: (212) 659-5475 Fax: (212) 849-2456 WWW (MSSM home): http://www.mountsinai.org/Find%20A%20Faculty/profile.do?id=0000072500001497192632 WWW (Lab home - software, publications): http://inka.mssm.edu/~mezei WWW (Department): http://atlas.physbio.mssm.edu From owner-chemistry@ccl.net Wed Oct 19 15:55:00 2011 From: "Sebastian Kozuch kozuchs:yahoo.com" To: CCL Subject: CCL: Science code manifesto Message-Id: <-45700-111019134817-30070-YOGCPwcHsS9A2zCWaCs99w(0)server.ccl.net> X-Original-From: Sebastian Kozuch Content-Type: multipart/alternative; boundary="1975645123-57344941-1319046488=:47733" Date: Wed, 19 Oct 2011 10:48:08 -0700 (PDT) MIME-Version: 1.0 Sent to CCL by: Sebastian Kozuch [kozuchs ~~ yahoo.com] --1975645123-57344941-1319046488=:47733 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable It is clear that the issue of the Science Code Manifesto=0Astirred a lot of= feelings. I must say that I found this topic almost as=0Aenjoyable as the = match between Fortran and C++. I learnt a couple of small=0Athings with thi= s discussion, but all in all I still subscribe to the same ideas=0AI wrote = some days ago:=0A=C2=A0=0A=E2=80=9CI like the idea of open software, and in= principle I=0Asupport the proposal of having access to the software code= =E2=80=A6 However, I feel=0Athat this is more wishful thinking than real sc= ientific life.=E2=80=9D=0A=E2=80=9COpen or closed software (usually) doesn= =E2=80=99t make me any=0Adifference, except from a philosophical perspectiv= e.=E2=80=9D=0A=C2=A0=0AI think of myself as a pragmatic guy. That means tha= t as=0Amuch as I like GPL and open software, I am prepared to pay for a clo= se program that=0Aworks better. And I dare to say most of the computational= chemists try not to=0Adive into the code anyway, as we only want to obtain= an energy value, and not=0Aunderstand how the Roothaan=E2=80=99s equations= were implemented.=0ALet me rescue one thought that appeared (sorry I don= =E2=80=99t=0Aremember the name of the author): =E2=80=9CReproducibility lie= s in the possibility of=0Aobtaining similar values, not necessarily with th= e same program=E2=80=9D. So, if for=0Aexample I wrote down in the paper the= functional and the basis set, that should=0Abe enough for reproducibility = of the energy and geometry (although a good=0Aresearcher should help his co= mmunity by giving much more information). If=0Aprograms X and Y give very d= ifferent bond distances, then clearly one of them=0Ais doing wrong its work= . Who cares about the code if it=E2=80=99s a crappy code anyway?=0AIt is no= t reproducible, and that is death according to the scientific method. =0AI = still think that we must compare ourselves with the=0Aexperimental chemistr= y world, and don=E2=80=99t make a special epistemological rules for=0Acompu= tational science. If I use a specific NMR, nobody will ask me to publish=0A= the designs of the equipment including the Fourier transform algorithm and = the=0Acrystal structure of the superconductor (which I don=E2=80=99t have a= nyway); I must=0Aonly write down the model and frequency of the machine. Sh= ould we ask more from=0Athe theoreticians?=0A=C2=A0=0ALet me be repetitive:= =0A=E2=80=9CI like the idea of open software, and in principle I=0Asupport = the proposal of having access to the software code=E2=80=A6 However, I feel= =0Athat this is more wishful thinking than real scientific life.=E2=80=9D= =0A=C2=A0=0ANow back to my calculations. From time to time we must stop=0Aa= rguing and make a bit of real work.=0A=0ABest,=0ASebastian=0A=0A=0A=0A_____= ___________________________=0AFrom: Sebastian Kozuch kozuchs^_^yahoo.com =0ATo: "Kozuch, Sebastian " =0ASent: Sunday, October 16, 2011 11:05 AM=0ASubject: CCL: Science code = manifesto=0A=0A=0AI like the idea of open software, and in principle I supp= ort=0Athe proposal of having access to the software code for the possibilit= y of full=0Areview of a theoretical work. However, I feel that this is more= wishful=0Athinking than real scientific life.=0A=C2=A0=0ALet=E2=80=99s say= that I receive a manuscript for review that uses=0Aprogram X, which I may = have access to its code now (since I=E2=80=99m the reviewer). Do=0AI have t= o spend a week trying to understand its algorithms to check if the=0Afreque= ncies are correctly calculated? For me, most (if not all) the programs are= =0Ain practice black boxes, as I=E2=80=99m hardly a programmer. Therefore, = open or closed=0Asoftware (usually) doesn=E2=80=99t make me any difference,= except from a philosophical perspective.=0A=0A=C2=A0=0ANow, let me conside= r an analogy from experimental chemistry.=0AI have to review a manuscript w= here the authors tested some compound with a 600=0AMHz NMR. My lab is much = more humble, and I only have access to a 400 MHz NMR.=0AId est, I cannot re= produce the results of the paper that I=E2=80=99m reviewing. Is here=0Aany = difference compared to the theoretical case?=0A=C2=A0=0AI had in a couple o= f cases problems to reproduce the results=0Aof papers, mostly because the a= uthors didn=E2=80=99t provide enough information.=0AHowever, for 100% repro= ducibility I need 100% the same conditions (and a lot of=0Aresources and ti= me). This makes the selection of the software just a small=0Aissue.=0A=C2= =A0=0AI would like to hear the thoughts of other people about this=0Aissue,= as I consider the peer reviewing process a very complex and far from=0Aper= fect system.=0A=C2=A0=0A=0A=0A=0A=0ASebastian=0A=0A=0A=0A__________________= ______________=0AFrom: Brian.James.Duke{:}gmail.com =0ATo: "Kozuch, Sebastian " =0ASent: Friday, October= 14, 2011 11:18 PM=0ASubject: CCL: Science code manifesto=0A=0A=0AI just en= countered the Science Code Manifesto, which essentially states=0Athat all c= omputer code used for scientific analysis and modeling should=0Abe availabl= e for review. It appears to have started with the Climate=0ACode Foundatio= n. I encourage you to visit the web site and consider=0Aendorsing the Mani= festo.=0A=0Ahttp://sciencecodemanifesto.org/=0A=0ANote that this is not spe= cifying open source code, so GAMESS(US),=0AGAMESS(UK), DALTON etc., as well= as open source codes such as PSI3 amd=0AMPQC satisfies the points of the m= anifesto. Of course some other quantum=0Achemistry codes do not. I think we= should be putting pressure on the=0Aauthors of such codes to meet the crit= eria in this manifesto.=0A=0ABrian. --1975645123-57344941-1319046488=:47733 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable
=0A=0A
It is clear that the issue of the Science Code Manifesto=0Astirred= a lot of feelings. I must say that I found this topic almost as=0Aenjoyabl= e as the match between Fortran and C++. I learnt a couple of small=0Athings= with this discussion, but all in all I still subscribe to the same ideas= =0AI wrote some days ago:
=0A=0A
 
= =0A=0A
=E2=80=9CI like the idea of open software, a= nd in principle I=0Asupport the proposal of having access to the software c= ode=E2=80=A6 However, I feel=0Athat this is more wishful thinking than real= scientific life.=E2=80=9D
=0A=0A
=E2=80=9COpe= n or closed software (usually) doesn=E2=80=99t make me any=0Adifference, ex= cept from a philosophical perspective.=E2=80=9D
=0A=0A
 
=0A=0A
I think of myself as a = pragmatic guy. That means that as=0Amuch as I like GPL and open software, I= am prepared to pay for a close program that=0Aworks better. And I dare to = say most of the computational chemists try not to=0Adive into the code anyw= ay, as we only want to obtain an energy value, and not=0Aunderstand how the= Roothaan=E2=80=99s equations were implemented.
=0A=0A
Let me rescue one thought that appeared (sorry I don=E2=80=99t=0Ar= emember the name of the author): =E2=80=9CReproducibility lies in the possi= bility of=0Aobtaining similar values, not necessarily with the same program= =E2=80=9D. So, if for=0Aexample I wrote down in the paper the functional an= d the basis set, that should=0Abe enough for reproducibility of the energy = and geometry (although a good=0Aresearcher should help his community by giv= ing much more information). If=0Aprograms X and Y give very different bond = distances, then clearly one of them=0Ais doing wrong its work. Who cares ab= out the code if it=E2=80=99s a crappy code anyway?=0AIt is not reproducible= , and that is death according to the scientific method.
=0A=0A
I still think that we must compare ourselves with the=0Ae= xperimental chemistry world, and don=E2=80=99t make a special epistemologic= al rules for=0Acomputational science. If I use a specific NMR, nobody will = ask me to publish=0Athe designs of the equipment including the Fourier tran= sform algorithm and the=0Acrystal structure of the superconductor (which I = don=E2=80=99t have anyway); I must=0Aonly write down the model and frequenc= y of the machine. Should we ask more from=0Athe theoreticians?
=0A=0A<= div class=3D"MsoNormal"> 
=0A=0A
Let me b= e repetitive:
=0A=0A
=E2=80=9CI like the idea = of open software, and in principle I=0Asupport the proposal of having acces= s to the software code=E2=80=A6 However, I feel=0Athat this is more wishful= thinking than real scientific life.=E2=80=9D
=0A=0A
 
=0A=0A
Now back to my calculatio= ns. From time to time we must stop=0Aarguing and make a bit of real work.=0A=0A

Best,
Sebastian


From: Sebastian Kozuch kozuchs^_^yahoo.com <owner-c= hemistry=-=ccl.net>
To: "Kozuch, Sebastian " <kozuchs=-=yahoo.com>
Sent: Sunday, October 16, 2011 11:05 AM<= br>Subject: CCL: Science c= ode manifesto

=0A
=0A=0A
I lik= e the idea of open software, and in principle I support=0Athe proposal of h= aving access to the software code for the possibility of full=0Areview of a= theoretical work. However, I feel that this is more wishful=0Athinking tha= n real scientific life.
=0A=0A
&nb= sp;
=0A=0A
Let=E2=80=99s say that = I receive a manuscript for review that uses=0Aprogram X, which I may have a= ccess to its code now (since I=E2=80=99m the reviewer). Do=0AI have to spen= d a week trying to understand its algorithms to check if the=0Afrequencies = are correctly calculated? For me, most (if not all) the programs are=0Ain p= ractice black boxes, as I=E2=80=99m hardly a programmer. Therefore, open or= closed=0Asoftware (usually) doesn=E2=80=99t make me any difference, except= from a philosophical perspective.
=0A=0A
 
=0A=0A
Now, let= me consider an analogy from experimental chemistry.=0AI have to review a m= anuscript where the authors tested some compound with a 600=0AMHz NMR. My l= ab is much more humble, and I only have access to a 400 MHz NMR.=0AId est, = I cannot reproduce the results of the paper that I=E2=80=99m reviewing. Is = here=0Aany difference compared to the theoretical case?
=0A=0A
 
=0A=0A
I had in a couple of cases problems to reproduce the results=0Aof = papers, mostly because the authors didn=E2=80=99t provide enough informatio= n.=0AHowever, for 100% reproducibility I need 100% the same conditions (and= a lot of=0Aresources and time). This makes the selection of the software j= ust a small=0Aissue.
=0A=0A
 =
=0A=0A
I would like to hear the t= houghts of other people about this=0Aissue, as I consider the peer reviewin= g process a very complex and far from=0Aperfect system.
=0A=0A
&nb= sp;



Sebastian


From: Bria= n.James.Duke{:}gmail.com <owner-chemistry()ccl.net>
To: "Kozuch, Sebastian " <kozuchs()yah= oo.com>
Sent: Friday,= October 14, 2011 11:18 PM
Subject:= CCL: Science code manifesto

=0A
I just encountered the Science Code Manifesto, which essentially sta= tes
that all computer code used for scientific analysis and modeling sho= uld
be available for review. It appears to have started with the Climat= e
Code Foundation. I encourage you to visit the web site and considerendorsing the Manifesto.

http://sciencecodemanifesto.org/

N= ote that this is not specifying open source code, so GAMESS(US),
GAMESS(= UK), DALTON etc., as well as open source codes such as PSI3 amd
MPQC sat= isfies the points of the manifesto. Of course some other quantum
chemist= ry codes do not. I think we should be putting pressure on the
authors of= such codes to meet the criteria in this manifesto.

Brian.

=


--1975645123-57344941-1319046488=:47733-- From owner-chemistry@ccl.net Wed Oct 19 16:30:01 2011 From: "Sebastian Kozuch kozuchs:-:yahoo.com" To: CCL Subject: CCL: Having fun with the science code manifesto Message-Id: <-45701-111019133928-3782-a7qgesNppDYIfVI6l9LpTA ~~ server.ccl.net> X-Original-From: Sebastian Kozuch Content-Type: multipart/alternative; boundary="270156429-1988330702-1319045960=:31035" Date: Wed, 19 Oct 2011 10:39:20 -0700 (PDT) MIME-Version: 1.0 Sent to CCL by: Sebastian Kozuch [kozuchs=yahoo.com] --270156429-1988330702-1319045960=:31035 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable It is clear that the issue of the Science Code Manifesto=0Astirred a lot of= feelings. I must say that I found this topic almost as=0Aenjoyable as the = match between Fortran and C++. I learnt a couple of small=0Athings with thi= s discussion, but all in all I still subscribe to the same ideas=0AI wrote = some days ago:=0A=C2=A0=0A=E2=80=9CI like the idea of open software, and in= principle I=0Asupport the proposal of having access to the software code= =E2=80=A6 However, I feel=0Athat this is more wishful thinking than real sc= ientific life.=E2=80=9D=0A=E2=80=9COpen or closed software (usually) doesn= =E2=80=99t make me any=0Adifference, except from a philosophical perspectiv= e.=E2=80=9D=0A=C2=A0=0AI think of myself as a pragmatic guy. That means tha= t as=0Amuch as I like GPL and open software, I am prepared to pay for a clo= se program that=0Aworks better. And I dare to say most of the computational= chemists try not to=0Adive into the code anyway, as we only want to obtain= an energy value, and not=0Aunderstand how the Roothaan=E2=80=99s equations= were implemented.=0ALet me rescue one thought that appeared (sorry I don= =E2=80=99t=0Aremember the name of the author): =E2=80=9CReproducibility lie= s in the possibility of=0Aobtaining similar values, not necessarily with th= e same program=E2=80=9D. So, if for=0Aexample I wrote down in the paper the= functional and the basis set, that should=0Abe enough for reproducibility = of the geometry (although a good=0Aresearcher should help his community by = giving much more information). If=0Aprograms X and Y give very different bo= nd distances, then clearly one of them=0Ais doing wrong its work. Who cares= about the code if it=E2=80=99s a crappy code anyway?=0AIt is not reproduci= ble, and that is death according to the scientific method. =0A=0A=0AI still= think that we must compare ourselves with the=0Aexperimental chemistry wor= ld, and don=E2=80=99t make a special epistemological rules for=0Acomputatio= nal science. If I use a specific NMR, nobody will ask me to publish=0Athe d= esigns of the equipment including the Fourier transform algorithm and the= =0Acrystal structure of the superconductor (which I don=E2=80=99t have anyw= ay); I must=0Aonly write down the model and frequency of the machine. Shoul= d we ask more from=0Athe theoreticians?=0A=C2=A0=0ALet me be repetitive:=0A= =E2=80=9CI like the idea of open software, and in principle I=0Asupport the= proposal of having access to the software code=E2=80=A6 However, I feel=0A= that this is more wishful thinking than real scientific life.=E2=80=9D=0A= =C2=A0=0ANow back to my calculations. From time to time we must stop=0Aargu= ing and make a bit of real work.=0ABest,=0ASebastian --270156429-1988330702-1319045960=:31035 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable
=0A=0A
It is clear that the issue of the Science Code Manifesto=0Astirred= a lot of feelings. I must say that I found this topic almost as=0Aenjoyabl= e as the match between Fortran and C++. I learnt a couple of small=0Athings= with this discussion, but all in all I still subscribe to the same ideas= =0AI wrote some days ago:
=0A=0A
 
= =0A=0A
=E2=80=9CI like the idea of open software, a= nd in principle I=0Asupport the proposal of having access to the software c= ode=E2=80=A6 However, I feel=0Athat this is more wishful thinking than real= scientific life.=E2=80=9D
=0A=0A
=E2=80=9COpe= n or closed software (usually) doesn=E2=80=99t make me any=0Adifference, ex= cept from a philosophical perspective.=E2=80=9D
=0A=0A
 
=0A=0A
I think of myself as a = pragmatic guy. That means that as=0Amuch as I like GPL and open software, I= am prepared to pay for a close program that=0Aworks better. And I dare to = say most of the computational chemists try not to=0Adive into the code anyw= ay, as we only want to obtain an energy value, and not=0Aunderstand how the= Roothaan=E2=80=99s equations were implemented.
=0A=0A
Let me rescue one thought that appeared (sorry I don=E2=80=99t=0Ar= emember the name of the author): =E2=80=9CReproducibility lies in the possi= bility of=0Aobtaining similar values, not necessarily with the same program= =E2=80=9D. So, if for=0Aexample I wrote down in the paper the functional an= d the basis set, that should=0Abe enough for reproducibility of the geometr= y (although a good=0Aresearcher should help his community by giving much mo= re information). If=0Aprograms X and Y give very different bond distances, = then clearly one of them=0Ais doing wrong its work. Who cares about the cod= e if it=E2=80=99s a crappy code anyway?=0AIt is not reproducible, and that = is death according to the scientific method.

=0A=0A
I still think that we must co= mpare ourselves with the=0Aexperimental chemistry world, and don=E2=80=99t = make a special epistemological rules for=0Acomputational science. If I use = a specific NMR, nobody will ask me to publish=0Athe designs of the equipmen= t including the Fourier transform algorithm and the=0Acrystal structure of = the superconductor (which I don=E2=80=99t have anyway); I must=0Aonly write= down the model and frequency of the machine. Should we ask more from=0Athe= theoreticians?
=0A=0A
 
=0A=0A
Let me be repetitive:
=0A=0A
=E2=80=9CI like the idea of open software, and in principle I=0Asupport = the proposal of having access to the software code=E2=80=A6 However, I feel= =0Athat this is more wishful thinking than real scientific life.=E2=80=9D=0A=0A
 
=0A=0A
Now back to my calculations. From time to time we must stop=0Aarguing an= d make a bit of real work.
=0A=0A
Best,
Sebastian

--270156429-1988330702-1319045960=:31035-- From owner-chemistry@ccl.net Wed Oct 19 17:05:00 2011 From: "Alexander Bagaturyants bagaturyants{=}gmail.com" To: CCL Subject: CCL:G: Science code manifesto Message-Id: <-45702-111019142132-14714-9blRTthdKrhL5CmIgEtDLQ++server.ccl.net> X-Original-From: "Alexander Bagaturyants" Content-Language: ru Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="UTF-8" Date: Wed, 19 Oct 2011 22:21:23 +0400 MIME-Version: 1.0 Sent to CCL by: "Alexander Bagaturyants" [bagaturyants-#-gmail.com] Dear CCLers, I believe that the current discussion becomes counterproductive. Of course, the author may decide whether to give an access to one's code or not. It is the author's inherent right. Actually, the code itself is necessary if (when) somebody wants to use it somehow in his/her future work. > From the pure scientific point of view, we need only the results of calculations and their verification. However, some authors are so inadequate that they even protest against any comparison or benchmarking of their codes and the results obtained. I believe that the scientific community must strongly oppose such a position. It has nothing to do with an honest attitude to science at all. Any code must be open to any comparison, verification and benchmarking, and the results of such a comparison, verification or benchmarking must be fully available to scientific community. > -----Original Message----- > From: owner-chemistry+sasha==photonics.ru---ccl.net [mailto:owner- > chemistry+sasha==photonics.ru---ccl.net] On Behalf Of George Fitzgerald > George.Fitzgerald/./accelrys.com > Sent: Wednesday, October 19, 2011 6:50 PM > To: Багатурьянц Александр Александрович > Subject: CCL:G: Science code manifesto > > > Sent to CCL by: George Fitzgerald [George.Fitzgerald[A]accelrys.com] > As a member of a company that makes money from selling software, I > probably have a different outlook on this than most CCLers. But I have > one very practical question: as a reviewer, do you really have the time > and expertise to review 1000s of lines of source code? I find that > properly reviewing a paper already takes several hours. From experience > I know that reviewing somebody's source code can take days. > > Can anybody give me an example of what you'd even look for in the > source code? I'm thinking back to, for example, Peter Gill's 'PRISM' > method for Gaussian integration, or Benny Johnson and DFT analytic 2nd > derivatives. Are you claiming that those papers shouldn’t have been > published without the reviewer reviewing the code? > > -george> To recover the email address of the author of the message, please > change> From owner-chemistry@ccl.net Wed Oct 19 17:41:00 2011 From: "Serdar Badoglu sbadoglu##gazi.edu.tr" To: CCL Subject: CCL:G: Macro or something for Gaussian 03 Message-Id: <-45703-111019145621-25935-m0zlEuFpBkvDHi426m//yA ~~ server.ccl.net> X-Original-From: "Serdar Badoglu" Date: Wed, 19 Oct 2011 14:56:17 -0400 Sent to CCL by: "Serdar Badoglu" [sbadoglu-$-gazi.edu.tr] Hi CCL'ers, Do any of you have / know an excel macro / a little program to extract calculated frequencies, infrared intensities, and Raman activities from Gaussian log file? I don't need the complete spectrum generated, I only need the mentioned data of vibrational modes. I'm really tired of switching between windows and reading and typing the data into spreadsheets. Regards. From owner-chemistry@ccl.net Wed Oct 19 18:27:00 2011 From: "Laurence Cuffe cuffe^_^mac.com" To: CCL Subject: CCL:G: Science code manifesto Message-Id: <-45704-111019170051-2966-IdwwST7Z5mJR9wy+WDCrRQ^server.ccl.net> X-Original-From: Laurence Cuffe Content-type: multipart/alternative; boundary="Boundary_(ID_o2MbhioYPbvve8LNeVXD7w)" Date: Wed, 19 Oct 2011 21:00:36 +0000 (GMT) MIME-version: 1.0 Sent to CCL by: Laurence Cuffe [cuffe[-]mac.com] --Boundary_(ID_o2MbhioYPbvve8LNeVXD7w) Content-type: text/plain; charset=utf-8; format=flowed Content-transfer-encoding: quoted-printable On Oct 19, 2011, at 03:41 PM, "Jaroslav Kalinovski kofeinu:+:hotmail.com" = wrote: > > Sent to CCL by: Jaroslav Kalinovski [kofeinu-$-hotmail.com] > > Hi, > I really do not understand... for me, the source code I am writing is li= ke an equipment in the laboratory. Do you really need someone else equipme= nt to make a proper review? Maybe in this case it is easy to send the code= but rules are the same. Do you ask for particle accelerator when reviewin= g paper about experiment with one? > > If someone is having troubles with reproducing my results, one always ca= n write to me and simply ask for the code but I do not feel I HAVE TO publ= ish code explicitly. In the end it is my property, I can describe algorith= m, points of theory but why should I give the code? No one is watching at = the hands of experimentalists while reviewing their papers. > > I think people are forgetting that code is just a tool not a research re= sult. > > It is true that relying only on reputation is not the perfect way but it= works for other disciplines. In the end we always have to put on some tru= st in authors. > That is all for me. > > Best regards, > J.Kalinowski =20 And that's exactly it, you don't have to publish your source code, but you= should be prepared to let someone look over it if they ask. Also you don't have to let them use it, you can protect that by copyright,= but people should be able to look over it and satisfy themselves that it = should do what you say it will. The right to look should be free, the rig= ht to run or license the code can be whatever you think the market can bea= r. The call for openness is in the context of devising code to analyze large = data sets in the context of climate change. In this context we should not = be comfortable with a researcher telling us that they wrote a program to e= xtract the key data, who then asserts that they wont let us see how it wor= ks. I don't expect a reviewer to check the code implementation of every compu= tational development that apears, this would be a very onerous task, and w= ould not be realistic. However if a problem arises where a number of progr= ams claim to be calculating the same thing, but get different results, the= n I think the scientific community would be best served if they knew that = the source code was available for their perusal. There may be a case for certain restrictions to apply, but these restricti= ons and limitations should be clearly flagged in any resulting scientific = papers. A very hypothetical example of this might be where a researcher w= as working for a pharmaceutical company which had developed proprietary so= ftware for developing leads in drug discovery. If the researcher wished to= pubish details of research in which a number of potential drug candidates= had been developed by applying the software to orphan drug discovery, wit= hout giving details of precisely how they had been developed then it=EF=BB= =BF might be appropriate to flag this and publish the leads as a matter of= public interest. One final point, open access to your code does allow ones rivals to see wh= at and how you have written it, but it also allows you to examine their co= de and identify cases where they have stolen your IP. All the best Laurence Cuffe > > --- > Laboratory of Physical Chemistry > University of Helsinki > --- > Logic is a systematic method of coming to the wrong conclusion with conf= idence. > > > ---------------------------------------- > > From: owner-chemistry- -ccl.net > > To: kofeinu- -hotmail.com > > Subject: CCL:G: Science code manifesto > > Date: Wed, 19 Oct 2011 07:50:08 -0700 > > > > > > Sent to CCL by: George Fitzgerald [George.Fitzgerald[A]accelrys.com] > > As a member of a company that makes money from selling software, I pro= bably have a different outlook on this than most CCLers. But I have one ve= ry practical question: as a reviewer, do you really have the time and expe= rtise to review 1000s of lines of source code? I find that properly review= ing a paper already takes several hours. From experience I know that revie= wing somebody's source code can take days. > > > > Can anybody give me an example of what you'd even look for in the sour= ce code? I'm thinking back to, for example, Peter Gill's 'PRISM' method fo= r Gaussian integration, or Benny Johnson and DFT analytic 2nd derivatives.= Are you claiming that those papers shouldn=E2=80=99t have been published = without the reviewer reviewing the code? > > > > -george> > > > > > > -=3D This is automatically added to each message by the mailing script =3D= -> > --Boundary_(ID_o2MbhioYPbvve8LNeVXD7w) Content-type: multipart/related; boundary="Boundary_(ID_xjgkfyRBHDO5WrGT+yR05w)"; type="text/html" --Boundary_(ID_xjgkfyRBHDO5WrGT+yR05w) Content-type: text/html; charset=windows-1252 Content-transfer-encoding: quoted-printable


On Oct 19, 2011, at 03:41 PM, "Jaroslav Kalinovski= kofeinu:+:hotmail.com" <owner-chemistry_._ccl.net> wrote:


=0ASent to CCL by: Jaroslav Kalinovski [kofeinu-$-hotmail.com]
=0A=
=0AHi,
=0AI really do not understand... for me, the source code I a= m writing is like an equipment in the laboratory. Do you really need someo= ne else equipment to make a proper review? Maybe in this case it is easy t= o send the code but rules are the same. Do you ask for particle accelerato= r when reviewing paper about experiment with one?
=0A
=0AIf someone= is having troubles with reproducing my results, one always can write to m= e and simply ask for the code but I do not feel I HAVE TO publish code exp= licitly. In the end it is my property, I can describe algorithm, points of= theory but why should I give the code? No one is watching at the hands of= experimentalists while reviewing their papers.
=0A
=0AI think peopl= e are forgetting that code is just a tool not a research result.
=0A=0AIt is true that relying only on reputation is not the perfect way but = it works for other disciplines. In the end we always have to put on some t= rust in authors.
=0AThat is all for me.
=0A
=0ABest regards,=0AJ.Kalinowski
 
And that's exactly it, you don't have to publish your source code, but yo= u should be prepared to let someone look over it if they ask.
Also you don't have to let them use it, you can protect that by copyrigh= t, but people should be able to look over it and satisfy themselves that i= t should do what you say it will.  The right to look should be free, = the right to run or license the code can be whatever you think the market = can bear.

The call for openness is in the con= text of devising code to analyze large data sets in the context of climate= change. In this context we should not be comfortable with a researcher te= lling us that they wrote a program to extract the key data, who then asser= ts that they wont let us see how it works.

&n= bsp;I don't expect a reviewer to check the code implementation of every co= mputational development that apears, this would be a very onerous task, an= d would not be realistic. However if a problem arises where a number of pr= ograms claim to be calculating the same thing, but get different results, = then I think the scientific community would be best served if they knew th= at the source code was available for their perusal.

There may be a case for certain restrictions to apply, but these re= strictions and limitations should be clearly flagged in any resulting scie= ntific papers.  A very hypothetical example of this might be where a = researcher was working for a pharmaceutical company which had developed pr= oprietary software for developing leads in drug discovery. If the research= er wished to pubish details of research in which a number of potential dru= g candidates had been developed by applying the software to orphan drug di= scovery, without giving details of precisely how they had been developed t= hen it might be appropriate to flag this and publish the leads as a matter= of public interest.

One final point, open ac= cess to your code does allow ones rivals to see what and how you have writ= ten it, but it also allows you to examine their code and identify cases wh= ere they have stolen your IP.

All the best

Laurence Cuffe

=0A---
=0ALaboratory = of Physical Chemistry
=0AUniversity of Helsinki
=0A---
=0ALogic i= s a systematic method of coming to the wrong conclusion with confidence.=0A
=0A
=0A----------------------------------------
=0A> F= rom: owner-chemistry- -ccl.net
=0A> To: kofeinu- -hotmail.com
=0A= > Subject: CCL:G: Science code manifesto
=0A> Date: Wed, 19 Oct 2= 011 07:50:08 -0700
=0A>
=0A>
=0A> Sent to CCL by: George= Fitzgerald [George.Fitzgerald[A]accelrys.com]
=0A> As a member of a= company that makes money from selling software, I probably have a differe= nt outlook on this than most CCLers. But I have one very practical questio= n: as a reviewer, do you really have the time and expertise to review 1000= s of lines of source code? I find that properly reviewing a paper already = takes several hours. From experience I know that reviewing somebody's sour= ce code can take days.
=0A>
=0A> Can anybody give me an exampl= e of what you'd even look for in the source code? I'm thinking back to, fo= r example, Peter Gill's 'PRISM' method for Gaussian integration, or Benny = Johnson and DFT analytic 2nd derivatives. Are you claiming that those pape= rs shouldn=92t have been published without the reviewer reviewing the code= ?
=0A>
=0A> -george>
=0A>
=0A
=0A
=0A
=0A= -=3D This is automatically added to each message by the mailing script =3D= -
=0ATo recover the email address of the author of the message, please = change
=0Athe strange characters on the top line to the _._ sign. You can= also
=0A
=0A=0AE-mail to subscribers: CHEMISTRY_._ccl.net or use:
=0A http://www.ccl.net/cgi-bin/c= cl/send_ccl_message
=0A
=0AE-mail to administrators: CHEMISTRY-REQUEST_._ccl.net or use
=0A http://www.ccl.net/cgi-bin/ccl/send_ccl_mess= age
=0A
=0A
=0A http://www.ccl.net/chemistry/sub_unsub.shtml<= br>=0A
=0ABefore posting, check wait time at: http://www.ccl.net
=0A
=0A= Job: http://www.ccl.net/jobs
=0AConferences: http://server.ccl.net/che= mistry/announcements/conferences/
=0A
=0ASearch Messages: http://www.ccl.net/chemist= ry/searchccl/index.shtml
=0A
=0AIf your mail bounces from CCL wi= th 5.7.1 error, check:
=0A http://www.ccl.net/spam= mers.txt
=0A
=0ARTFI: http://www.ccl.net/chemistry/aboutccl/instructions/
=0A=
=0A
=0A
= --Boundary_(ID_xjgkfyRBHDO5WrGT+yR05w)-- --Boundary_(ID_o2MbhioYPbvve8LNeVXD7w)-- From owner-chemistry@ccl.net Wed Oct 19 19:02:00 2011 From: "David A Mannock dmannock- -ualberta.ca" To: CCL Subject: CCL: Science code manifesto - just publish the code Message-Id: <-45705-111019182922-23375-B3NBcWbz8mxnSOEpBNuh6w[#]server.ccl.net> X-Original-From: David A Mannock Content-Type: multipart/alternative; boundary=001485f44bf0a3204d04afae5ee4 Date: Wed, 19 Oct 2011 16:29:13 -0600 MIME-Version: 1.0 Sent to CCL by: David A Mannock [dmannock- -ualberta.ca] --001485f44bf0a3204d04afae5ee4 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Konrad, As I said in my reply to Mark,, maybe that is something that we can encourage here among the readers of the forum. I have one friend, a carbohydrate chemist here, who had one paper in an ACS journal with 109 supplementary pages of conformational analysis. I gave him a hard time in a humorous way, but the data had value to others in the area. Dave On Wed, Oct 19, 2011 at 1:16 AM, Konrad Hinsen hinsen^^cnrs-orleans.fr < owner-chemistry-x-ccl.net> wrote: > > Sent to CCL by: Konrad Hinsen [hinsen**cnrs-orleans.fr] > On 18 oct. 11, at 20:32, Vincent Leroux vincent.leroux- -loria.fr wrote: > > Wherever access to the code is necessary to reproduce results, it *must* >> be provided one way or another (e.g. as supporting information) if it is= not >> freely available already, this is just plain obvious to me. >> >> Wondering why a manifesto would be needed in order to do just that... >> > > It is needed because "just that" does not happen today. I get a lot of > computational science papers to review, but none of them had any code > attached (not even small analysis scripts), nor any explanation of how I > could obtain it. Even for my own articles, I rarely provide scripts becau= se > most journals don't want them. One day I'll put all that on my Web site. > > As a reviewer, I insist that authors provide a full explanation of all > algorithms they use (usually through literature references of course), an= d > that they state which versions of which programs were used. I often ask t= hem > to consider making code and data available, but that has happened only in > one case. I would like to be able to request such information more firmly= , > but I don't see much of a basis for that at the moment: it's neither a > general habit nor something encouraged by the author guidelines of the ma= jor > scientific journals. > -- > ------------------------------**------------------------------**--------- > Konrad Hinsen > Centre de Biophysique Mol=E9culaire, CNRS Orl=E9ans > Synchrotron Soleil - Division Exp=E9riences > Saint Aubin - BP 48 > 91192 Gif sur Yvette Cedex, France > Tel. +33-1 69 35 97 15 > E-Mail: research at khinsen dot fastmail dot net > http://dirac.cnrs-orleans.fr/~**hinsen/ > ------------------------------**------------------------------**--------- > > > > -=3D This is automatically added to each message by the mailing script = =3D-> http://www.ccl.net/cgi-bin/**ccl/send_ccl_message http://www.ccl.net/cgi-bin/**ccl/send_ccl_message > Job: http://www.ccl.net/**jobsConferences: > http://server.ccl.net/**chemistry/announcements/**conferences/ > > Search Messages: http://www.ccl.net/chemistry/**searchccl/index.shtml http://www.ccl.net/spammers.**txt > > RTFI: http://www.ccl.net/chemistry/**aboutccl/instructions/ > > > --001485f44bf0a3204d04afae5ee4 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Konrad, As I said in my reply to Mark,, maybe that is something that we can= encourage here among the readers of the forum. I have one friend, a carboh= ydrate chemist here, who had one paper in an ACS journal with 109 supplemen= tary pages of conformational analysis. I gave him a hard time in a humorous= way, but the data had value to others in the area. Dave

On Wed, Oct 19, 2011 at 1:16 AM, Konrad Hins= en hinsen^^cnrs-orleans.fr <owner-chemistry-x-ccl= .net> wrote:

Sent to CCL by: Konrad Hinsen [hinsen**cnrs-orleans.fr]
On 18 oct. 11, at 20:32, Vincent Leroux vincent.leroux- -loria.fr wrote:

Wherever access to the code is necessary to reproduce results, it *must* be= provided one way or another (e.g. as supporting information) if it is not = freely available already, this is just plain obvious to me.

Wondering why a manifesto would be needed in order to do just that...

It is needed because "just that" does not happen today. I get a l= ot of computational science papers to review, but none of them had any code= attached (not even small analysis scripts), nor any explanation of how I c= ould obtain it. Even for my own articles, I rarely provide scripts because = most journals don't want them. One day I'll put all that on my Web = site.

As a reviewer, I insist that authors provide a full explanation of all algo= rithms they use (usually through literature references of course), and that= they state which versions of which programs were used. I often ask them to= consider making code and data available, but that has happened only in one= case. I would like to be able to request such information more firmly, but= I don't see much of a basis for that at the moment: it's neither a= general habit nor something encouraged by the author guidelines of the maj= or scientific journals.
--
-------------------------------------------------------------= --------
Konrad Hinsen
Centre de Biophysique Mol=E9culaire, CNRS Orl=E9ans
Synchrotron Soleil - Division Exp=E9riences
Saint Aubin - BP 48
91192 Gif sur Yvette Cedex, France
Tel. +33-1 69 35 97 15
E-Mail: research at =A0khinsen dot fastmail dot net
http:= //dirac.cnrs-orleans.fr/~hinsen/
-------------------------------------------------------------= --------



-=3D This is automatically added to each message by the mailing script =3D-=
E-mail to subscribers: CHEMISTRY-x-ccl.net or use:
=A0 =A0 http://www.ccl.net/cgi-bin/ccl/send_ccl_message

E-mail to administrators: CHEMISTRY-REQUEST-x-ccl.net or use
=A0 =A0 http://www.ccl.net/cgi-bin/ccl/send_ccl_message
http://www.ccl.net/chemistry/sub_unsu= b.shtml

Before posting, check wait time at: http://www.ccl.net

Job: http:= //www.ccl.net/jobsConferences: http://server.ccl.n= et/chemistry/announcements/conferences/

Search Messages: http://www.ccl.net/chemistry/searchccl/index.= shtml
=A0 =A0 http= ://www.ccl.net/spammers.txt

RTFI: http://www.ccl.net/chemistry/aboutccl/instructions/=



--001485f44bf0a3204d04afae5ee4-- From owner-chemistry@ccl.net Wed Oct 19 19:37:00 2011 From: "Michel Petitjean petitjean.chiral(-)gmail.com" To: CCL Subject: CCL: Science code manifesto Message-Id: <-45706-111019184607-6693-5cY3l86rit5A2Rgsg74OWQ^_^server.ccl.net> X-Original-From: Michel Petitjean Content-Type: text/plain; charset=ISO-8859-1 Date: Wed, 19 Oct 2011 18:45:42 -0400 MIME-Version: 1.0 Sent to CCL by: Michel Petitjean [petitjean.chiral[#]gmail.com] My contribution to the thread: Most reviewers are even unable to check most basic math formulas, so I hardly imagine them reading thousands of source lines including library calls (my own ones are in fortran, and commented in French !), thus I agree with George Fitzgerald. Despite that I give for free some of my sources, I hardly imagine giving the result of many years of my work to somebody without any counterpart (what should think my employer, who pays me ?). Publishing clearly the algorithms and giving a free access to the binaries should suffice to check. And even in this situation, it can be a hard work: checking the result sometimes need to compute it with an other algorithm (when existing !!), and it is much work. Once programmed, checking that the result is the same may be more difficult than generating the result itself. Some examples illustrating various difficulties for checking the results: Example 1: generate a d-dimensional Voronoi diagram. Example 2: calculate the minimal radius cylinder enclosing N points. Example 3: calculate analytically the vdw surface of N intersecting spheres. People ready to do that are welcome to contact me. And please be sure that I would be delighted to learn about comparisons, at least for ex. 2 and 3. All my best, Michel Petitjean MTi, INSERM UMR-S 973, University Paris 7 35 rue Helene Brion, 75205 Paris Cedex 13, France. Phone: +331 5727 8434; Fax: +331 5727 8372 E-mail: petitjean.chiral**gmail.com (preferred), michel.petitjean**univ-paris-diderot.fr http://petitjeanmichel.free.fr/itoweb.petitjean.freeware.html 2011/10/19 Alexander Bagaturyants bagaturyants{=}gmail.com : > > Sent to CCL by: "Alexander Bagaturyants" [bagaturyants-#-gmail.com] > Dear CCLers, > I believe that the current discussion becomes counterproductive. > Of course, the author may decide whether to give an access to one's code or not. It is the author's inherent right. > Actually, the code itself is necessary if (when) somebody wants to use it somehow in his/her future work. >> From the pure scientific point of view, we need only the results of calculations and their verification. > However, some authors are so inadequate that they even protest against any comparison or benchmarking of their codes and the results obtained. > I believe that the scientific community must strongly oppose such a position. It has nothing to do with an honest attitude to science at all. > Any code must be open to any comparison, verification and benchmarking, and the results of such a comparison, verification or benchmarking must be fully available to scientific community. > From owner-chemistry@ccl.net Wed Oct 19 20:11:00 2011 From: "Dario Fernando Coral ferchocoralg16 .. gmail.com" To: CCL Subject: CCL:G: AIM for molecular ion Message-Id: <-45707-111019192026-27612-n5dikaJA3MhSysYXvPD2IQ_._server.ccl.net> X-Original-From: "Dario Fernando Coral" Date: Wed, 19 Oct 2011 19:20:22 -0400 Sent to CCL by: "Dario Fernando Coral" [ferchocoralg16__gmail.com] regards Does anyone know how to perform AIM calculations in gaussian 09 or 03 for a molecular ion? I have performed this calculation for the neutral water molecule, but when I use AIM command on the furfuryl-aniline molecular ion, gaussian 09 has an error in the link L609 The route card I used is as follows: # ub3lyp/631g(d) AIM=charge test thanks in advance From owner-chemistry@ccl.net Wed Oct 19 21:25:00 2011 From: "David A Mannock dmannock*_*ualberta.ca" To: CCL Subject: CCL: Science code manifesto Message-Id: <-45708-111019183934-804-I0n5Hm8VsIH6pEEkWXOe1A\a/server.ccl.net> X-Original-From: David A Mannock Content-Type: multipart/alternative; boundary=001485f44bf0c5803604afae82d4 Date: Wed, 19 Oct 2011 16:39:19 -0600 MIME-Version: 1.0 Sent to CCL by: David A Mannock [dmannock]|[ualberta.ca] --001485f44bf0c5803604afae82d4 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Andy, you have made many good points here. I wonder if the solution, given that many academics have access to online resources through their universit= y libraries, would be to allow access to institutions with licences to commercialized code and to those with their own local licence. No licence = =3D no access. Just an idea. It would also exclude the great unwashed from the equation, although it would not prevent piracy. Dave On Wed, Oct 19, 2011 at 3:38 AM, Andrew Dalke dalke a dalkescientific.com < owner-chemistry~~ccl.net> wrote: > > Sent to CCL by: Andrew Dalke [dalke^^^dalkescientific.com] > On Oct 18, 2011, at 4:17 PM, Adri=E0 Cereto Massagu=E9 adrian.cereto,+, > gmail.com wrote: > > I don't think the manifesto is at odds with FSF. GPL'd software can be > sold at any price, but its source code must be available for those who ow= n > the software at no further cost. And someone who has bought some GPL > software is allowed to redistribute it for free, so researchers using it = for > a paper would be able to provide the software to reviewers and readers of > the paper at no cost. > > Abstract: How much can the paper authors ask for access to the > source code? How much can the curators charge? What should the > curator do if the curated software contains a license violation? > > > If I write a paper which depends on software for its analysis, > and others should have access to the software as part of effective > peer review, then how much can I charge others to get access to > the software? US $1 billion? > > The FSF says I can charge as much as you want, and that freedom > is one of the core freedoms of free software. > > The philosophy that others need access to my source code to > provide good peer review has the implicit assumption that I > will provide the software at a non-prohibitive cost. > > There is clearly a tension between these two viewpoints. This > manifesto says nothing of what that cost might be, nor even > that it might be an issue. > > > What should be the cost to get access to source code from the > author, or from the curator? Does the curator get no-cost access > to it as a condition of publication? Doesn't any limit on cost > curtail what the FSF says is my freedom to charge as much as > I want? > > Remember, the FSF encourages software freedom. I argue that > scientific communication has overlapping but different goals. > Science communications needs to have a low cost so that many > people can get access to it. The FSF is only concerned about > what happens *after* someone gets access to software. > > > This is of course similar to (most) scientific papers. There > the author gives the curator the right to redistribute the paper > without paying royalties, and the curator can charge effectively > any price for it. Most paper publishers want to maximize revenue, > and therefore set high but not prohibitive prices. The software > author may have other concerns. > > > Interestingly, the software curator takes on a more difficult challenge > than a paper curator. The authors of a paper (with a few exceptions > usually well-covered by fair use exceptions) are the only copyright > holders of a paper. More often though, the accompanying software has > many more copyright holders. That can lead to problems. > > Consider the CDK chemistry toolkit. The package contains many > copyright holders, including those from third-party libraries > which it incorporates. A few years ago the CDK was in minor > violation of the LGPL requirement of some of those libraries. > (It omitted the credit required by those licenses.) This was > quickly fixed once pointed out. I can easily imagine cases > where it can't be easily fixed. > > The curator takes on the risk that someone else, who is a > copyright holder to the software in question but not a paper > author, may challenge the right of the curator to distribute > the software. How does the curator resolve the violation, > especially if the original author doesn't want to be involved? > Does the curator remove the software in question? > > If so, and if you insist that the software must be available in > order to do correct peer review, then should the corresponding paper > also be withdrawn? > > As I said before, these are solvable. I bring them up because > I encourage people to distribute their source code along with > the paper, and to be aware that it's not a simple, clear issue. > > > Andrew > dalke:+:dalkescientific.com > > > > -=3D This is automatically added to each message by the mailing script = =3D-> > > --001485f44bf0c5803604afae82d4 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Andy, you have made many good points here. I wonder if the solution, given = that many academics have access to online resources through their universit= y libraries, would be to allow access to institutions with licences to comm= ercialized code and to those with their own local licence. No licence =3D n= o access. Just an idea. It would also exclude the great unwashed from the e= quation, although it would not prevent piracy. Dave

On Wed, Oct 19, 2011 at 3:38 AM, Andrew Dalk= e dalke a dalkescientific.com <owner-chemis= try~~ccl.net> wrote:

Sent to CCL by: Andrew Dalke [dalke^^^dalkescientific.com]
On Oct 18, 2011, at 4:17 PM, Adri=E0 Cereto Massagu=E9 adrian.cereto,+,gmail.com wrote:
> I don't think the manifesto is at odds with FSF. GPL'd softwar= e can be sold at any price, but its source code must be available for those= who own the software at no further cost. And someone who has bought some G= PL software is allowed to redistribute it for free, so researchers using it= for a paper would be able to provide the software to reviewers and readers= of the paper at no cost.

Abstract: How much can the paper authors ask for access to the
source code? How much can the curators charge? What should the
curator do if the curated software contains a license violation?


If I write a paper which depends on software for its analysis,
and others should have access to the software as part of effective
peer review, then how much can I charge others to get access to
the software? US $1 billion?

The FSF says I can charge as much as you want, and that freedom
is one of the core freedoms of free software.

The philosophy that others need access to my source code to
provide good peer review has the implicit assumption that I
will provide the software at a non-prohibitive cost.

There is clearly a tension between these two viewpoints. This
manifesto says nothing of what that cost might be, nor even
that it might be an issue.


What should be the cost to get access to source code from the
author, or from the curator? Does the curator get no-cost access
to it as a condition of publication? Doesn't any limit on cost
curtail what the FSF says is my freedom to charge as much as
I want?

Remember, the FSF encourages software freedom. I argue that
scientific communication has overlapping but different goals.
Science communications needs to have a low cost so that many
people can get access to it. The FSF is only concerned about
what happens *after* someone gets access to software.


This is of course similar to (most) scientific papers. There
the author gives the curator the right to redistribute the paper
without paying royalties, and the curator can charge effectively
any price for it. Most paper publishers want to maximize revenue,
and therefore set high but not prohibitive prices. The software
author may have other concerns.


Interestingly, the software curator takes on a more difficult challenge
than a paper curator. The authors of a paper (with a few exceptions
usually well-covered by fair use exceptions) are the only copyright
holders of a paper. More often though, the accompanying software has
many more copyright holders. That can lead to problems.

Consider the CDK chemistry toolkit. The package contains many
copyright holders, including those from third-party libraries
which it incorporates. A few years ago the CDK was in minor
violation of the LGPL requirement of some of those libraries.
(It omitted the credit required by those licenses.) This was
quickly fixed once pointed out. I can easily imagine cases
where it can't be easily fixed.

The curator takes on the risk that someone else, who is a
copyright holder to the software in question but not a paper
author, may challenge the right of the curator to distribute
the software. How does the curator resolve the violation,
especially if the original author doesn't want to be involved?
Does the curator remove the software in question?

If so, and if you insist that the software must be available in
order to do correct peer review, then should the corresponding paper
also be withdrawn?

As I said before, these are solvable. I bring them up because
I encourage people to distribute their source code along with
the paper, and to be aware that it's not a simple, clear issue.


=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0Andrew
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0dalke:+:dalkescientific.com



-=3D This is automatically added to each message by the mailing script =3D-=
E-mail to subscribers:
CHEMISTRY~~ccl.n= et or use:
=A0 =A0 =A0http://www.ccl.net/cgi-bin/ccl/send_ccl_message

E-mail to administrators: CHEM= ISTRY-REQUEST~~ccl.net or use
=A0 =A0 =A0http://www.ccl.net/cgi-bin/ccl/send_ccl_message

Subscribe/Unsubscribe:
=A0 =A0 =A0http://www.ccl.net/chemistry/sub_unsub.shtml

Before posting, check wait time at: http://www.ccl.net

Job: http://www.ccl.n= et/jobs
Conferences: http://server.ccl.net/chemistry/announcements/co= nferences/

Search Messages: http://www.ccl.net/chemistry/searchccl/index.shtml
=A0 =A0 =A0
h= ttp://www.ccl.net/spammers.txt

RTFI: http://www.ccl.net/chemistry/aboutccl/instructions/



--001485f44bf0c5803604afae82d4--