{"cells": [{"cell_type": "markdown", "metadata": {}, "source": ["# Projects\n", "\n", "Wells are one of the fundamental objects in welly.\n", "\n", "Well objects include collections of Curve objects. Multiple Well objects can be stored in a Project.\n", "\n", "On this page, we take a closer look at the `Project` class. It lets us handle groups of wells. It is really just a list of `Well` objects, with a few extra powers.\n", "\n", "First, some preliminaries\u2026"]}, {"cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [{"data": {"text/plain": ["'0.5.1.dev15+gbf10f3b.d20220223'"]}, "execution_count": 1, "metadata": {}, "output_type": "execute_result"}], "source": ["import welly\n", "\n", "welly.__version__"]}, {"cell_type": "markdown", "metadata": {}, "source": ["---\n", "\n", "## Make a project\n", "\n", "We have a few LAS files in a folder; we can load them all at once with standard POSIX file globbing syntax:"]}, {"cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [{"name": "stderr", "output_type": "stream", "text": ["2it [00:00, 19.00it/s]\n"]}], "source": ["p = welly.read_las(\"../../tests/assets/example_*.las\")"]}, {"cell_type": "markdown", "metadata": {}, "source": ["Now we have a project, containing two files:"]}, {"cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [{"data": {"text/html": ["
IndexUWIDataCurves
03-2-B12 curvesSP, SN, ILD, LLS, LLD, MLL, NPHI, RHOZ, CAL1, GRC, DTP, CAL2
13-2-A12 curvesSP, SN, ILD, LLS, LLD, MLL, NPHI, RHOB, CAL1, GR, DT, CAL2
"], "text/plain": ["Project(2 wells: 3-2-B, 3-2-A)"]}, "execution_count": 3, "metadata": {}, "output_type": "execute_result"}], "source": ["p"]}, {"cell_type": "markdown", "metadata": {}, "source": ["You can pass in a list of files or URLs:"]}, {"cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [{"name": "stderr", "output_type": "stream", "text": ["0it [00:00, ?it/s]Only engine='normal' can read wrapped files\n", "3it [00:06, 2.09s/it]\n"]}], "source": ["p = welly.read_las(['../../tests/assets/P-129_out.LAS',\n", " 'https://geocomp.s3.amazonaws.com/data/P-130.LAS',\n", " 'https://geocomp.s3.amazonaws.com/data/R-39.las',\n", " ])"]}, {"cell_type": "markdown", "metadata": {}, "source": ["This project has three wells:"]}, {"cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [{"data": {"text/html": ["
IndexUWIDataCurves
0Long = 63* 45'24.460 W24 curvesCALI, HCAL, PEF, DT, DTS, DPHI_SAN, DPHI_LIM, DPHI_DOL, NPHI_SAN, NPHI_LIM, NPHI_DOL, RLA5, RLA3, RLA4, RLA1, RLA2, RXOZ, RXO_HRLT, RT_HRLT, RM_HRLT, DRHO, RHOB, GR, SP
1100/N14A/11E0518 curvesCALI, DT, NPHI_SAN, NPHI_LIM, NPHI_DOL, DPHI_LIM, DPHI_SAN, DPHI_DOL, M2R9, M2R6, M2R3, M2R2, M2R1, GR, SP, PEF, DRHO, RHOB
2303N76434006030022 curvesBS, CALI, CHR1, CHR2, CHRP, CHRS, DRHO, DT1R, DT2, DT2R, DT4P, DT4S, GR, HD1, HD2, HD3, NPOR, PEF, RHOB, SPR1, TENS, VPVS
"], "text/plain": ["Project(3 wells: Long = 63* 45'24.460 W, 100/N14A/11E05, 303N764340060300)"]}, "execution_count": 5, "metadata": {}, "output_type": "execute_result"}], "source": ["p"]}, {"cell_type": "markdown", "metadata": {}, "source": ["Typical, the UWIs are a disaster. Let's ignore this for now.\n", "\n", "The `Project` is really just a list-like thing, so you can index into it to get at a single well. Each well is represented by a `welly.Well` object."]}, {"cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [{"data": {"text/html": ["
Kennetcook #2
Long = 63* 45'24.460 W
crsCRS({})
locationLat = 45* 12' 34.237\" N
countryCA
provinceNova Scotia
latitude
longitude
datum
section45.20 Deg N
rangePD 176
township63.75 Deg W
ekb94.8
egl90.3
gl90.3
tdd1935.0
tdl1935.0
tdNone
dataCALI, DPHI_DOL, DPHI_LIM, DPHI_SAN, DRHO, DT, DTS, GR, HCAL, NPHI_DOL, NPHI_LIM, NPHI_SAN, PEF, RHOB, RLA1, RLA2, RLA3, RLA4, RLA5, RM_HRLT, RT_HRLT, RXOZ, RXO_HRLT, SP
"], "text/plain": ["Well(uwi: 'Long = 63* 45'24.460 W', name: 'Kennetcook #2', 24 curves: ['CALI', 'HCAL', 'PEF', 'DT', 'DTS', 'DPHI_SAN', 'DPHI_LIM', 'DPHI_DOL', 'NPHI_SAN', 'NPHI_LIM', 'NPHI_DOL', 'RLA5', 'RLA3', 'RLA4', 'RLA1', 'RLA2', 'RXOZ', 'RXO_HRLT', 'RT_HRLT', 'RM_HRLT', 'DRHO', 'RHOB', 'GR', 'SP'])"]}, "execution_count": 6, "metadata": {}, "output_type": "execute_result"}], "source": ["p[0]"]}, {"cell_type": "markdown", "metadata": {}, "source": ["Some of the fields of this LAS file are messed up; see the [Well notebook](Wells.ipynb) for more on how to fix this. "]}, {"cell_type": "markdown", "metadata": {}, "source": ["## Plot curves from several wells\n", "\n", "The DT log is called DT4P in one of the wells. We can deal with this sort of issue with aliases. Let's set up an alias dictionary, then plot the DT log from each well:"]}, {"cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [], "source": ["alias = {'Sonic': ['DT', 'DT4P'],\n", " 'Caliper': ['HCAL', 'CALI'],\n", " }"]}, {"cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [{"data": {"image/png": "\n", "text/plain": ["
"]}, "metadata": {"needs_background": "light"}, "output_type": "display_data"}], "source": ["import matplotlib.pyplot as plt\n", "\n", "fig, axs = plt.subplots(figsize=(7, 14),\n", " ncols=len(p),\n", " sharey=True,\n", " )\n", "\n", "for i, (ax, w) in enumerate(zip(axs, p)):\n", " log = w.get_curve('Sonic', alias=alias)\n", " if log is not None:\n", " ax = log.plot(ax=ax)\n", " ax.set_title(\"Sonic log for\\n{}\".format(w.uwi))\n", "\n", "min_z, max_z = p.basis_range\n", " \n", "plt.ylim(max_z, min_z)\n", "plt.show()"]}, {"cell_type": "markdown", "metadata": {}, "source": ["## Get a `pandas.DataFrame`\n", "\n", "The `df()` method makes a DataFrame using a dual index of UWI and Depth.\n", "\n", "Before we export our wells, let's give Kennetcook #2 a better UWI:"]}, {"cell_type": "code", "execution_count": 11, "metadata": {"scrolled": false}, "outputs": [{"data": {"text/html": ["
Kennetcook #2
Kennetcook #2
crsCRS({})
locationLat = 45* 12' 34.237\" N
countryCA
provinceNova Scotia
latitude
longitude
datum
section45.20 Deg N
rangePD 176
township63.75 Deg W
ekb94.8
egl90.3
gl90.3
tdd1935.0
tdl1935.0
tdNone
dataCALI, DPHI_DOL, DPHI_LIM, DPHI_SAN, DRHO, DT, DTS, GR, HCAL, NPHI_DOL, NPHI_LIM, NPHI_SAN, PEF, RHOB, RLA1, RLA2, RLA3, RLA4, RLA5, RM_HRLT, RT_HRLT, RXOZ, RXO_HRLT, SP
"], "text/plain": ["Well(uwi: 'Kennetcook #2', name: 'Kennetcook #2', 24 curves: ['CALI', 'HCAL', 'PEF', 'DT', 'DTS', 'DPHI_SAN', 'DPHI_LIM', 'DPHI_DOL', 'NPHI_SAN', 'NPHI_LIM', 'NPHI_DOL', 'RLA5', 'RLA3', 'RLA4', 'RLA1', 'RLA2', 'RXOZ', 'RXO_HRLT', 'RT_HRLT', 'RM_HRLT', 'DRHO', 'RHOB', 'GR', 'SP'])"]}, "execution_count": 11, "metadata": {}, "output_type": "execute_result"}], "source": ["p[0].uwi = p[0].name\n", "p[0]"]}, {"cell_type": "markdown", "metadata": {}, "source": ["That's better.\n", "\n", "When creating the DataFrame, you can pass a list of the keys (mnemonics) you want, and use aliases as usual."]}, {"cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [{"data": {"text/plain": ["{'Sonic': ['DT', 'DT4P'], 'Caliper': ['HCAL', 'CALI']}"]}, "execution_count": 12, "metadata": {}, "output_type": "execute_result"}], "source": ["alias"]}, {"cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [{"data": {"text/html": ["
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
CaliperGRSonic
UWIDEPT
Kennetcook #21.06680000004.391284942646.6986503600NaN
1.21920000004.391284942646.6986503600NaN
1.37160000004.391284942646.6986503600NaN
1.52400000004.391284942646.6986503600NaN
1.67640000004.391284942646.6986503600NaN
...............
303N7643400603003387.5471999996303.709000000032.0276000039252.4951
3387.6995999996303.709000000032.0276000000252.4951
3387.8519999996303.709000000032.0276000000252.4951
3388.0043999996303.709000000032.0276000000252.4951
3388.1567999996303.709000000032.0276000000252.4951
\n", "

46594 rows \u00d7 3 columns

\n", "
"], "text/plain": [" Caliper GR Sonic\n", "UWI DEPT \n", "Kennetcook #2 1.0668000000 4.3912849426 46.6986503600 NaN\n", " 1.2192000000 4.3912849426 46.6986503600 NaN\n", " 1.3716000000 4.3912849426 46.6986503600 NaN\n", " 1.5240000000 4.3912849426 46.6986503600 NaN\n", " 1.6764000000 4.3912849426 46.6986503600 NaN\n", "... ... ... ...\n", "303N764340060300 3387.5471999996 303.7090000000 32.0276000039 252.4951\n", " 3387.6995999996 303.7090000000 32.0276000000 252.4951\n", " 3387.8519999996 303.7090000000 32.0276000000 252.4951\n", " 3388.0043999996 303.7090000000 32.0276000000 252.4951\n", " 3388.1567999996 303.7090000000 32.0276000000 252.4951\n", "\n", "[46594 rows x 3 columns]"]}, "execution_count": 16, "metadata": {}, "output_type": "execute_result"}], "source": ["keys = ['Caliper', 'GR', 'Sonic']\n", "\n", "df = p.df(keys=keys, alias=alias, rename_aliased=True)\n", "df"]}, {"cell_type": "markdown", "metadata": {}, "source": ["## Quality\n", "\n", "Welly can run quality tests on the curves in your project. Some of the tests take arguments. You can test for things like this:\n", "\n", "- `all_positive`: Passes if all the values are greater than zero.\n", "- `all_above(50)`: Passes if all the values are greater than 50.\n", "- `mean_below(100)`: Passes if the mean of the log is less than 100.\n", "- `no_nans`: Passes if there are no NaNs in the log.\n", "- `no_flat`: Passes if there are no sections of well log with the same values (e.g. because a gap was interpolated across with a constant value).\n", "- `no_monotonic`: Passes if there are no monotonic ramps in the log (e.g. because a gap was linearly interpolated across).\n", "\n", "Insert lists of tests into a dictionary with any of the following key examples:\n", "\n", "- `'GR'`: The test(s) will run against the GR log.\n", "- `'Gamma'`: The test(s) will run against the log matching according to the alias dictionary.\n", "- `'Each'`: The test(s) will run against *every log* in a well.\n", "- `'All'`: Some tests take multiple logs as input, for example `quality.no_similarities`. These test(s) will run against all the logs as a group. Could be quite slow, because there may be a lot of pairwise comparisons to do.\n", "\n", "The tests are run against all wells in the project. If you only want to run against a subset of the wells, make a new project for them."]}, {"cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [], "source": ["import welly.quality as q\n", "\n", "tests = {\n", " 'All': [q.no_similarities],\n", " 'Each': [q.no_gaps, q.no_monotonic, q.no_flat],\n", " 'GR': [q.all_positive],\n", " 'Sonic': [q.all_positive, q.all_between(50, 200)],\n", "}"]}, {"cell_type": "markdown", "metadata": {}, "source": ["Let's add our own test for units:"]}, {"cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [], "source": ["def has_si_units(curve):\n", " return curve.units.lower() in ['mm', 'gapi', 'us/m', 'k/m3']\n", "\n", "tests['Each'].append(has_si_units)"]}, {"cell_type": "markdown", "metadata": {}, "source": ["We'll use the same alias dictionary as before:"]}, {"cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [{"data": {"text/plain": ["{'Sonic': ['DT', 'DT4P'], 'Caliper': ['HCAL', 'CALI']}"]}, "execution_count": 13, "metadata": {}, "output_type": "execute_result"}], "source": ["alias"]}, {"cell_type": "markdown", "metadata": {}, "source": ["Now we can run the tests and look at the results, which are in an HTML table:"]}, {"cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [{"data": {"text/html": ["
IdxUWIDataPassingCaliper*GRSonic*SPRHOB
%3/3 wells3/3 wells3/3 wells2/3 wells3/3 wells
0Kennetcook #25/24 curves54HCAL

4.39 in
GR

78.99 gAPI
DT

63.08 us/ft
SP

52.47 mV
RHOB

2.61 g/cm3
1100/N14A/11E055/18 curves79CALI

8.90 in
GR

103.74 gAPI
DT

74.90 us/ft
SP

101.60 mV
RHOB

2.62 g/cm3
2303N7643400603004/22 curves78CALI

311.97 MM
GR

67.49 GAPI
DT4P

279.84 US/M

RHOB

2493.56 K/M3
"], "text/plain": [""]}, "execution_count": 14, "metadata": {}, "output_type": "execute_result"}], "source": ["from IPython.display import HTML\n", "\n", "HTML(p.curve_table_html(keys=['Caliper', 'GR', 'Sonic', 'SP', 'RHOB'],\n", " tests=tests, alias=alias)\n", " )"]}, {"cell_type": "markdown", "metadata": {}, "source": ["Here's how to interpret the result:\n", "\n", "- Green background: the log is present. You can see the mean value and the units (check them!!).\n", "- Grey background: the log is not present.\n", "\n", "And the traffic light dots (hover to see how many tests passed): \n", "\n", "- Green dot: all the tests passed.\n", "- Orange dot: some tests failed.\n", "- Red dot: all tests failed.\n", "- Grey dot: no tests ran.\n", "\n", "The **Passing** percentage shows how many tests passed for that well."]}, {"cell_type": "markdown", "metadata": {}, "source": ["---\n", "\n", "© 2022 Agile Scientific, CC BY"]}], "metadata": {"kernelspec": {"display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3"}, "language_info": {"codemirror_mode": {"name": "ipython", "version": 3}, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.9.7"}}, "nbformat": 4, "nbformat_minor": 2}