source: sasmodels/sasmodels/generate.py @ f619de7

core_shell_microgelscostrafo411magnetic_modelrelease_v0.94release_v0.95ticket-1257-vesicle-productticket_1156ticket_1265_superballticket_822_more_unit_tests
Last change on this file since f619de7 was f619de7, checked in by Paul Kienzle <pkienzle@…>, 8 years ago

more type hinting

  • Property mode set to 100644
File size: 22.5 KB
Line 
1"""
2SAS model constructor.
3
4Small angle scattering models are defined by a set of kernel functions:
5
6    *Iq(q, p1, p2, ...)* returns the scattering at q for a form with
7    particular dimensions averaged over all orientations.
8
9    *Iqxy(qx, qy, p1, p2, ...)* returns the scattering at qx, qy for a form
10    with particular dimensions for a single orientation.
11
12    *Imagnetic(qx, qy, result[], p1, p2, ...)* returns the scattering for the
13    polarized neutron spin states (up-up, up-down, down-up, down-down) for
14    a form with particular dimensions for a single orientation.
15
16    *form_volume(p1, p2, ...)* returns the volume of the form with particular
17    dimension, or 1.0 if no volume normalization is required.
18
19    *ER(p1, p2, ...)* returns the effective radius of the form with
20    particular dimensions.
21
22    *VR(p1, p2, ...)* returns the volume ratio for core-shell style forms.
23
24    #define INVALID(v) (expr)  returns False if v.parameter is invalid
25    for some parameter or other (e.g., v.bell_radius < v.radius).  If
26    necessary, the expression can call a function.
27
28These functions are defined in a kernel module .py script and an associated
29set of .c files.  The model constructor will use them to create models with
30polydispersity across volume and orientation parameters, and provide
31scale and background parameters for each model.
32
33*Iq*, *Iqxy*, *Imagnetic* and *form_volume* should be stylized C-99
34functions written for OpenCL.  All functions need prototype declarations
35even if the are defined before they are used.  OpenCL does not support
36*#include* preprocessor directives, so instead the list of includes needs
37to be given as part of the metadata in the kernel module definition.
38The included files should be listed using a path relative to the kernel
39module, or if using "lib/file.c" if it is one of the standard includes
40provided with the sasmodels source.  The includes need to be listed in
41order so that functions are defined before they are used.
42
43Floating point values should be declared as *double*.  For single precision
44calculations, *double* will be replaced by *float*.  The single precision
45conversion will also tag floating point constants with "f" to make them
46single precision constants.  When using integral values in floating point
47expressions, they should be expressed as floating point values by including
48a decimal point.  This includes 0., 1. and 2.
49
50OpenCL has a *sincos* function which can improve performance when both
51the *sin* and *cos* values are needed for a particular argument.  Since
52this function does not exist in C99, all use of *sincos* should be
53replaced by the macro *SINCOS(value, sn, cn)* where *sn* and *cn* are
54previously declared *double* variables.  When compiled for systems without
55OpenCL, *SINCOS* will be replaced by *sin* and *cos* calls.   If *value* is
56an expression, it will appear twice in this case; whether or not it will be
57evaluated twice depends on the quality of the compiler.
58
59If the input parameters are invalid, the scattering calculator should
60return a negative number. Particularly with polydispersity, there are
61some sets of shape parameters which lead to nonsensical forms, such
62as a capped cylinder where the cap radius is smaller than the
63cylinder radius.  The polydispersity calculation will ignore these points,
64effectively chopping the parameter weight distributions at the boundary
65of the infeasible region.  The resulting scattering will be set to
66background.  This will work correctly even when polydispersity is off.
67
68*ER* and *VR* are python functions which operate on parameter vectors.
69The constructor code will generate the necessary vectors for computing
70them with the desired polydispersity.
71The kernel module must set variables defining the kernel meta data:
72
73    *id* is an implicit variable formed from the filename.  It will be
74    a valid python identifier, and will be used as the reference into
75    the html documentation, with '_' replaced by '-'.
76
77    *name* is the model name as displayed to the user.  If it is missing,
78    it will be constructed from the id.
79
80    *title* is a short description of the model, suitable for a tool tip,
81    or a one line model summary in a table of models.
82
83    *description* is an extended description of the model to be displayed
84    while the model parameters are being edited.
85
86    *parameters* is the list of parameters.  Parameters in the kernel
87    functions must appear in the same order as they appear in the
88    parameters list.  Two additional parameters, *scale* and *background*
89    are added to the beginning of the parameter list.  They will show up
90    in the documentation as model parameters, but they are never sent to
91    the kernel functions.  Note that *effect_radius* and *volfraction*
92    must occur first in structure factor calculations.
93
94    *category* is the default category for the model.  The category is
95    two level structure, with the form "group:section", indicating where
96    in the manual the model will be located.  Models are alphabetical
97    within their section.
98
99    *source* is the list of C-99 source files that must be joined to
100    create the OpenCL kernel functions.  The files defining the functions
101    need to be listed before the files which use the functions.
102
103    *ER* is a python function defining the effective radius.  If it is
104    not present, the effective radius is 0.
105
106    *VR* is a python function defining the volume ratio.  If it is not
107    present, the volume ratio is 1.
108
109    *form_volume*, *Iq*, *Iqxy*, *Imagnetic* are strings containing the
110    C source code for the body of the volume, Iq, and Iqxy functions
111    respectively.  These can also be defined in the last source file.
112
113    *Iq* and *Iqxy* also be instead be python functions defining the
114    kernel.  If they are marked as *Iq.vectorized = True* then the
115    kernel is passed the entire *q* vector at once, otherwise it is
116    passed values one *q* at a time.  The performance improvement of
117    this step is significant.
118
119    *demo* is a dictionary of parameter=value defining a set of
120    parameters to use by default when *compare* is called.  Any
121    parameter not set in *demo* gets the initial value from the
122    parameter list.  *demo* is mostly needed to set the default
123    polydispersity values for tests.
124
125An *model_info* dictionary is constructed from the kernel meta data and
126returned to the caller.
127
128The doc string at the start of the kernel module will be used to
129construct the model documentation web pages.  Embedded figures should
130appear in the subdirectory "img" beside the model definition, and tagged
131with the kernel module name to avoid collision with other models.  Some
132file systems are case-sensitive, so only use lower case characters for
133file names and extensions.
134
135
136The function :func:`make` loads the metadata from the module and returns
137the kernel source.  The function :func:`make_doc` extracts the doc string
138and adds the parameter table to the top.  The function :func:`model_sources`
139returns a list of files required by the model.
140
141Code follows the C99 standard with the following extensions and conditions::
142
143    M_PI_180 = pi/180
144    M_4PI_3 = 4pi/3
145    square(x) = x*x
146    cube(x) = x*x*x
147    sinc(x) = sin(x)/x, with sin(0)/0 -> 1
148    all double precision constants must include the decimal point
149    all double declarations may be converted to half, float, or long double
150    FLOAT_SIZE is the number of bytes in the converted variables
151"""
152from __future__ import print_function
153
154#TODO: determine which functions are useful outside of generate
155#__all__ = ["model_info", "make_doc", "make_source", "convert_type"]
156
157from os.path import abspath, dirname, join as joinpath, exists, getmtime
158import re
159import string
160import warnings
161
162import numpy as np
163
164from .modelinfo import Parameter
165from .custom import load_custom_kernel_module
166
167try:
168    from typing import Tuple, Sequence, Iterator
169    from .modelinfo import ModelInfo
170except ImportError:
171    pass
172
173TEMPLATE_ROOT = dirname(__file__)
174
175F16 = np.dtype('float16')
176F32 = np.dtype('float32')
177F64 = np.dtype('float64')
178try:  # CRUFT: older numpy does not support float128
179    F128 = np.dtype('float128')
180except TypeError:
181    F128 = None
182
183# Conversion from units defined in the parameter table for each model
184# to units displayed in the sphinx documentation.
185RST_UNITS = {
186    "Ang": "|Ang|",
187    "1/Ang": "|Ang^-1|",
188    "1/Ang^2": "|Ang^-2|",
189    "1e-6/Ang^2": "|1e-6Ang^-2|",
190    "degrees": "degree",
191    "1/cm": "|cm^-1|",
192    "Ang/cm": "|Ang*cm^-1|",
193    "g/cm3": "|g/cm^3|",
194    "mg/m2": "|mg/m^2|",
195    "": "None",
196    }
197
198# Headers for the parameters tables in th sphinx documentation
199PARTABLE_HEADERS = [
200    "Parameter",
201    "Description",
202    "Units",
203    "Default value",
204    ]
205
206# Minimum width for a default value (this is shorter than the column header
207# width, so will be ignored).
208PARTABLE_VALUE_WIDTH = 10
209
210# Documentation header for the module, giving the model name, its short
211# description and its parameter table.  The remainder of the doc comes
212# from the module docstring.
213DOC_HEADER = """.. _%(id)s:
214
215%(name)s
216=======================================================
217
218%(title)s
219
220%(parameters)s
221
222%(returns)s
223
224%(docs)s
225"""
226
227def format_units(units):
228    # type: (str) -> str
229    """
230    Convert units into ReStructured Text format.
231    """
232    return "string" if isinstance(units, list) else RST_UNITS.get(units, units)
233
234def make_partable(pars):
235    # type: (List[Parameter]) -> str
236    """
237    Generate the parameter table to include in the sphinx documentation.
238    """
239    column_widths = [
240        max(len(p.name) for p in pars),
241        max(len(p.description) for p in pars),
242        max(len(format_units(p.units)) for p in pars),
243        PARTABLE_VALUE_WIDTH,
244        ]
245    column_widths = [max(w, len(h))
246                     for w, h in zip(column_widths, PARTABLE_HEADERS)]
247
248    sep = " ".join("="*w for w in column_widths)
249    lines = [
250        sep,
251        " ".join("%-*s" % (w, h)
252                 for w, h in zip(column_widths, PARTABLE_HEADERS)),
253        sep,
254        ]
255    for p in pars:
256        lines.append(" ".join([
257            "%-*s" % (column_widths[0], p.name),
258            "%-*s" % (column_widths[1], p.description),
259            "%-*s" % (column_widths[2], format_units(p.units)),
260            "%*g" % (column_widths[3], p.default),
261            ]))
262    lines.append(sep)
263    return "\n".join(lines)
264
265def _search(search_path, filename):
266    # type: (List[str], str) -> str
267    """
268    Find *filename* in *search_path*.
269
270    Raises ValueError if file does not exist.
271    """
272    for path in search_path:
273        target = joinpath(path, filename)
274        if exists(target):
275            return target
276    raise ValueError("%r not found in %s" % (filename, search_path))
277
278
279def model_sources(model_info):
280    # type: (ModelInfo) -> List[str]
281    """
282    Return a list of the sources file paths for the module.
283    """
284    search_path = [dirname(model_info.filename),
285                   abspath(joinpath(dirname(__file__), 'models'))]
286    return [_search(search_path, f) for f in model_info.source]
287
288def timestamp(model_info):
289    # type: (ModelInfo) -> int
290    """
291    Return a timestamp for the model corresponding to the most recently
292    changed file or dependency.
293    """
294    source_files = (model_sources(model_info)
295                    + model_templates()
296                    + [model_info.filename])
297    newest = max(getmtime(f) for f in source_files)
298    return newest
299
300def convert_type(source, dtype):
301    # type: (str, np.dtype) -> str
302    """
303    Convert code from double precision to the desired type.
304
305    Floating point constants are tagged with 'f' for single precision or 'L'
306    for long double precision.
307    """
308    if dtype == F16:
309        fbytes = 2
310        source = _convert_type(source, "half", "f")
311    elif dtype == F32:
312        fbytes = 4
313        source = _convert_type(source, "float", "f")
314    elif dtype == F64:
315        fbytes = 8
316        # no need to convert if it is already double
317    elif dtype == F128:
318        fbytes = 16
319        source = _convert_type(source, "long double", "L")
320    else:
321        raise ValueError("Unexpected dtype in source conversion: %s"%dtype)
322    return ("#define FLOAT_SIZE %d\n"%fbytes)+source
323
324
325def _convert_type(source, type_name, constant_flag):
326    # type: (str, str, str) -> str
327    """
328    Replace 'double' with *type_name* in *source*, tagging floating point
329    constants with *constant_flag*.
330    """
331    # Convert double keyword to float/long double/half.
332    # Accept an 'n' # parameter for vector # values, where n is 2, 4, 8 or 16.
333    # Assume complex numbers are represented as cdouble which is typedef'd
334    # to double2.
335    source = re.sub(r'(^|[^a-zA-Z0-9_]c?)double(([248]|16)?($|[^a-zA-Z0-9_]))',
336                    r'\1%s\2'%type_name, source)
337    # Convert floating point constants to single by adding 'f' to the end,
338    # or long double with an 'L' suffix.  OS/X complains if you don't do this.
339    source = re.sub(r'[^a-zA-Z_](\d*[.]\d+|\d+[.]\d*)([eE][+-]?\d+)?',
340                    r'\g<0>%s'%constant_flag, source)
341    return source
342
343
344def kernel_name(model_info, is_2d):
345    # type: (ModelInfo, bool) -> str
346    """
347    Name of the exported kernel symbol.
348    """
349    return model_info.name + "_" + ("Iqxy" if is_2d else "Iq")
350
351
352def indent(s, depth):
353    # type: (str, int) -> str
354    """
355    Indent a string of text with *depth* additional spaces on each line.
356    """
357    spaces = " "*depth
358    sep = "\n" + spaces
359    return spaces + sep.join(s.split("\n"))
360
361
362_template_cache = {}  # type: Dict[str, Tuple[int, str, str]]
363def load_template(filename):
364    # type: (str) -> str
365    path = joinpath(TEMPLATE_ROOT, filename)
366    mtime = getmtime(path)
367    if filename not in _template_cache or mtime > _template_cache[filename][0]:
368        with open(path) as fid:
369            _template_cache[filename] = (mtime, fid.read(), path)
370    return _template_cache[filename][1]
371
372def model_templates():
373    # type: () -> List[str]
374    # TODO: fails DRY; templates are listed in two places.
375    # should instead have model_info contain a list of paths
376    return [joinpath(TEMPLATE_ROOT, filename)
377            for filename in ('kernel_header.c', 'kernel_iq.c')]
378
379
380_FN_TEMPLATE = """\
381double %(name)s(%(pars)s);
382double %(name)s(%(pars)s) {
383    %(body)s
384}
385
386
387"""
388
389def _gen_fn(name, pars, body):
390    # type: (str, List[Parameter], str) -> str
391    """
392    Generate a function given pars and body.
393
394    Returns the following string::
395
396         double fn(double a, double b, ...);
397         double fn(double a, double b, ...) {
398             ....
399         }
400    """
401    par_decl = ', '.join(p.as_function_argument() for p in pars) if pars else 'void'
402    return _FN_TEMPLATE % {'name': name, 'body': body, 'pars': par_decl}
403
404def _call_pars(prefix, pars):
405    # type: (str, List[Parameter]) -> List[str]
406    """
407    Return a list of *prefix.parameter* from parameter items.
408    """
409    return [p.as_call_reference(prefix) for p in pars]
410
411_IQXY_PATTERN = re.compile("^((inline|static) )? *(double )? *Iqxy *([(]|$)",
412                           flags=re.MULTILINE)
413def _have_Iqxy(sources):
414    # type: (List[str]) -> bool
415    """
416    Return true if any file defines Iqxy.
417
418    Note this is not a C parser, and so can be easily confused by
419    non-standard syntax.  Also, it will incorrectly identify the following
420    as having Iqxy::
421
422        /*
423        double Iqxy(qx, qy, ...) { ... fill this in later ... }
424        */
425
426    If you want to comment out an Iqxy function, use // on the front of the
427    line instead.
428    """
429    for code in sources:
430        if _IQXY_PATTERN.search(code):
431            return True
432    else:
433        return False
434
435def make_source(model_info):
436    # type: (ModelInfo) -> str
437    """
438    Generate the OpenCL/ctypes kernel from the module info.
439
440    Uses source files found in the given search path.  Returns None if this
441    is a pure python model, with no C source components.
442    """
443    if callable(model_info.Iq):
444        raise ValueError("can't compile python model")
445
446    # TODO: need something other than volume to indicate dispersion parameters
447    # No volume normalization despite having a volume parameter.
448    # Thickness is labelled a volume in order to trigger polydispersity.
449    # May want a separate dispersion flag, or perhaps a separate category for
450    # disperse, but not volume.  Volume parameters also use relative values
451    # for the distribution rather than the absolute values used by angular
452    # dispersion.  Need to be careful that necessary parameters are available
453    # for computing volume even if we allow non-disperse volume parameters.
454
455    partable = model_info.parameters
456
457    # Identify parameters for Iq, Iqxy, Iq_magnetic and form_volume.
458    # Note that scale and volume are not possible types.
459
460    # Load templates and user code
461    kernel_header = load_template('kernel_header.c')
462    kernel_code = load_template('kernel_iq.c')
463    user_code = [open(f).read() for f in model_sources(model_info)]
464
465    # Build initial sources
466    source = [kernel_header] + user_code
467
468    # Make parameters for q, qx, qy so that we can use them in declarations
469    q, qx, qy = [Parameter(name=v) for v in ('q', 'qx', 'qy')]
470    # Generate form_volume function, etc. from body only
471    if isinstance(model_info.form_volume, str):
472        pars = partable.form_volume_parameters
473        source.append(_gen_fn('form_volume', pars, model_info.form_volume))
474    if isinstance(model_info.Iq, str):
475        pars = [q] + partable.iq_parameters
476        source.append(_gen_fn('Iq', pars, model_info.Iq))
477    if isinstance(model_info.Iqxy, str):
478        pars = [qx, qy] + partable.iqxy_parameters
479        source.append(_gen_fn('Iqxy', pars, model_info.Iqxy))
480
481    # Define the parameter table
482    source.append("#define PARAMETER_TABLE \\")
483    source.append("\\\n".join(p.as_definition()
484                              for p in partable.kernel_parameters))
485
486    # Define the function calls
487    if partable.form_volume_parameters:
488        refs = _call_pars("_v.", partable.form_volume_parameters)
489        call_volume = "#define CALL_VOLUME(_v) form_volume(%s)" % (",".join(refs))
490    else:
491        # Model doesn't have volume.  We could make the kernel run a little
492        # faster by not using/transferring the volume normalizations, but
493        # the ifdef's reduce readability more than is worthwhile.
494        call_volume = "#define CALL_VOLUME(v) 1.0"
495    source.append(call_volume)
496
497    refs = ["_q[_i]"] + _call_pars("_v.", partable.iq_parameters)
498    call_iq = "#define CALL_IQ(_q,_i,_v) Iq(%s)" % (",".join(refs))
499    if _have_Iqxy(user_code):
500        # Call 2D model
501        refs = ["q[2*i]", "q[2*i+1]"] + _call_pars("_v.", partable.iqxy_parameters)
502        call_iqxy = "#define CALL_IQ(_q,_i,_v) Iqxy(%s)" % (",".join(refs))
503    else:
504        # Call 1D model with sqrt(qx^2 + qy^2)
505        warnings.warn("Creating Iqxy = Iq(sqrt(qx^2 + qy^2))")
506        # still defined:: refs = ["q[i]"] + _call_pars("v", iq_parameters)
507        pars_sqrt = ["sqrt(_q[2*_i]*_q[2*_i]+_q[2*_i+1]*_q[2*_i+1])"] + refs[1:]
508        call_iqxy = "#define CALL_IQ(_q,_i,_v) Iq(%s)" % (",".join(pars_sqrt))
509
510    # Fill in definitions for numbers of parameters
511    source.append("#define MAX_PD %s"%partable.max_pd)
512    source.append("#define NPARS %d"%partable.npars)
513
514    # TODO: allow mixed python/opencl kernels?
515
516    # define the Iq kernel
517    source.append("#define KERNEL_NAME %s_Iq"%model_info.name)
518    source.append(call_iq)
519    source.append(kernel_code)
520    source.append("#undef CALL_IQ")
521    source.append("#undef KERNEL_NAME")
522
523    # define the Iqxy kernel from the same source with different #defines
524    source.append("#define KERNEL_NAME %s_Iqxy"%model_info.name)
525    source.append(call_iqxy)
526    source.append(kernel_code)
527    source.append("#undef CALL_IQ")
528    source.append("#undef KERNEL_NAME")
529
530    return '\n'.join(source)
531
532def load_kernel_module(model_name):
533    # type: (str) -> module
534    if model_name.endswith('.py'):
535        kernel_module = load_custom_kernel_module(model_name)
536    else:
537        from sasmodels import models
538        __import__('sasmodels.models.'+model_name)
539        kernel_module = getattr(models, model_name, None)
540    return kernel_module
541
542
543
544section_marker = re.compile(r'\A(?P<first>[%s])(?P=first)*\Z'
545                            %re.escape(string.punctuation))
546def _convert_section_titles_to_boldface(lines):
547    # type: (Sequence[str]) -> Iterator[str]
548    """
549    Do the actual work of identifying and converting section headings.
550    """
551    prior = None
552    for line in lines:
553        if prior is None:
554            prior = line
555        elif section_marker.match(line):
556            if len(line) >= len(prior):
557                yield "".join(("**", prior, "**"))
558                prior = None
559            else:
560                yield prior
561                prior = line
562        else:
563            yield prior
564            prior = line
565    if prior is not None:
566        yield prior
567
568def convert_section_titles_to_boldface(s):
569    # type: (str) -> str
570    """
571    Use explicit bold-face rather than section headings so that the table of
572    contents is not polluted with section names from the model documentation.
573
574    Sections are identified as the title line followed by a line of punctuation
575    at least as long as the title line.
576    """
577    return "\n".join(_convert_section_titles_to_boldface(s.split('\n')))
578
579def make_doc(model_info):
580    # type: (ModelInfo) -> str
581    """
582    Return the documentation for the model.
583    """
584    Iq_units = "The returned value is scaled to units of |cm^-1| |sr^-1|, absolute scale."
585    Sq_units = "The returned value is a dimensionless structure factor, $S(q)$."
586    docs = convert_section_titles_to_boldface(model_info.docs)
587    subst = dict(id=model_info.id.replace('_', '-'),
588                 name=model_info.name,
589                 title=model_info.title,
590                 parameters=make_partable(model_info.parameters.kernel_parameters),
591                 returns=Sq_units if model_info.structure_factor else Iq_units,
592                 docs=docs)
593    return DOC_HEADER % subst
594
595
596def demo_time():
597    # type: () -> None
598    """
599    Show how long it takes to process a model.
600    """
601    import datetime
602    from .modelinfo import make_model_info
603    from .models import cylinder
604
605    tic = datetime.datetime.now()
606    make_source(make_model_info(cylinder))
607    toc = (datetime.datetime.now() - tic).total_seconds()
608    print("time: %g"%toc)
609
610def main():
611    # type: () -> None
612    """
613    Program which prints the source produced by the model.
614    """
615    import sys
616    from .modelinfo import make_model_info
617
618    if len(sys.argv) <= 1:
619        print("usage: python -m sasmodels.generate modelname")
620    else:
621        name = sys.argv[1]
622        kernel_module = load_kernel_module(name)
623        model_info = make_model_info(kernel_module)
624        source = make_source(model_info)
625        print(source)
626
627if __name__ == "__main__":
628    main()
Note: See TracBrowser for help on using the repository browser.