====== Maintenance ====== Here are some tips to automate some of the day-to-day maintenance needed or recommended for DokuWiki. See also the plugins: [[plugin:cleanup|cleanup]] and [[plugin:clearhistory|clearhistory]] ===== Keep Blacklist up to date ===== See [[:blacklist]] on how to set up a cronjob to keep the Anti-Spam Blacklist current. ===== Automatic cleanup script ===== It is recommended to set up some cleanup process for busy DokuWikis. The following [[wp>Bash (Unix shell)]] shell script serves as an example. It deletes old revisions from the [[:attic]], removes stale lock files and empty directories, and it cleans up the [[:caching|cache]]((For a discussion of cache maintenance see also the [[https://forum.dokuwiki.org/post/22265|forum discussion]].)). #!/bin/bash cleanup() { local data_path="$1" # full path to data directory of wiki local retention_days="$2" # number of days after which old files are to be removed # purge files older than ${retention_days} days from attic and media_attic (old revisions) find "${data_path}"/{media_,}attic/ -type f -not -name _dummy -mtime +"${retention_days}" -delete # remove stale lock files (files which are 1-2 days old) find "${data_path}"/locks/ -name '*.lock' -type f -mtime +1 -delete # remove empty directories find "${data_path}"/{attic,cache,index,locks,media,media_attic,media_meta,meta,pages,tmp}/ \ -mindepth 1 -type d -empty -delete # remove files older than ${retention_days} days from the cache if test -n "$(find "${data_path}"/cache/?/ -maxdepth 1 -print -quit &> /dev/null)" then find "${data_path}"/cache/?/ -type f -not -name _dummy -mtime +"${retention_days}" -delete fi } # cleanup DokuWiki installations (path to datadir, number of days) # some examples: cleanup /home/user1/htdocs/doku/data 256 cleanup /home/user2/htdocs/mywiki/data 180 cleanup /var/www/superwiki/data 180 To run it automatically, set up a [[man>crontab(5)|cronjob]]. The following example calls the script every day 7 minutes after midnight. To run as non-root user remove ''root''. 7 0 * * * root /full/path/to/cleanup.sh Be sure to set everything up correctly - you don't want to delete the wrong things, do you? ==== Windows -- warmzip ==== A script for cleaning out old files on Windows systems is [[http://winadmin.tumblr.com/post/8005353779/warmzip-clean-up-folders-by-compressing-moving|waRmZip]], available from [[http://sourceforge.net/project/showfiles.php?group_id=88417|here on SourceForge]]. Write a batch file to call it, and schedule it to run every day. And as the man says: 'Be sure to set everything up correctly' ;-) I took the above suggestion to use ''waRmZip'' and wrote this batch file - maybe it will help out. My favorite way to run cron jobs on Windows is [[https://sourceforge.net/projects/pycron|PyCron]]. @echo off set waRmZip="c:\Program Files\waRmZip\waRmZip.wsf" set wikiHome="c:\path\to\htdocs\wiki\data" rem Move attic files older than 30 days to an archive location %waRmZip% %wikiHome%\attic /ma:30 /md:%wikiHome%_archive\attic /r /q rem Option: delete attic files older than 30 days rem %waRmZip% %wikiHome%\attic /da:30 /dc /r /q rem Delete empty attic directories; waRmZip requires the /da flag when using rem /df, so add filter for *.zzz so /da doesn't remove any files %waRmZip% %wikiHome%\attic /r /da:31 /df /fo:*.zzz /q rem Remove stale lock files %waRmZip% %wikiHome%\locks /da:1 /fo:*.lock /r /q rem Remove empty directories %waRmZip% %wikiHome%\pages /da:365 /df /fo:*.zzz /r /q ==== Windows -- batch script ==== This is another Windows command shell script for maintaining your dokuwiki base on a Windows environment. The script uses the free and open source utility find, which can be obtained via [[http://gnuwin32.sourceforge.net/]] All paths are read from the DokuWiki config file. Files to be deleted can be shown before deletion, to prevent accidental deletion of files. @echo off setlocal REM This script performs some basic DokuWiki maintenance REM Copyright (C) 2012 Peter Mosmans REM This program is free software: you can redistribute it and/or modify REM it under the terms of the GNU General Public License as published by REM the Free Software Foundation, either version 3 of the License, or REM (at your option) any later version. REM This program is distributed in the hope that it will be useful, REM but WITHOUT ANY WARRANTY; without even the implied warranty of REM MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the REM GNU General Public License for more details. REM You should have received a copy of the GNU General Public License REM along with this program. If not, see . REM Please contact support AT go-forward.net for questions and/or feedback REM Last modification: 02-05-2012 (Peter Mosmans) set NAME=maintain_dokuwiki set VERSION=0.13 REM path to the dokuwiki configuration file enclosed in double quotes set DOKUWIKICONFIG="\full\filename\of\your\dokuwiki\conf\local.php" REM preserve all files that are younger than DAYSTOKEEP days set DAYSTOKEEP=31 REM set to true if you want to show results and pause before deleting any files set SHOWRESULTSFIRST=true set FIND=c:\tools\find.exe set TEMPFILE=%TMP%\%NAME%.tmp REM see if all tools are present for %%i in (%FIND%) do ( if not exist %%i ( echo sorry, could not find %%i - exiting echo you can obtain the free GNU tools from gnuwin32.sourceforge.net exit /b ) ) REM see if the dokuwiki configuration file can be read if not exist %DOKUWIKICONFIG% ( echo sorry, could not find DokuWiki config at %DOKUWIKICONFIG% - exiting exit /b ) REM grab the correct paths from the configuration file for /f "usebackq delims=' tokens=2,4" %%i in (%DOKUWIKICONFIG%) do ( if /i "%%i"=="datadir" set DOCUMENTROOT=%%j if /i "%%i"=="olddir" set ATTICDIR=%%j if /i "%%i"=="cachedir" set CACHEDIR=%%j if /i "%%i"=="lockdir" set LOCKDIR=%%j ) if "%DOCUMENTROOT%" == "" ( echo sorry, could not find datadir variable in %DOKUWIKICONFIG%, exiting... exit /b ) REM use defaults if the paths are not specified if /i "%ATTICDIR%" == "" set ATTICDIR=%DOCUMENTROOT%/attic if /i "%LOCKDIR%" == "" set LOCKDIR=%DOCUMENTROOT%/lock if /i "%CACHEDIR%" == "" set CACHEDIR=%DOCUMENTROOT%/cache REM purge files older than DAYSTOKEEP days from the attic %FIND% "%ATTICDIR%" -type f -mtime +%DAYSTOKEEP% -print > %TEMPFILE% REM remove locks older than one day %FIND% "%LOCKDIR%" -name "*.lock" -type f -mtime +1 -print >> %TEMPFILE% REM remove cache files older than DAYSTOKEEP %FIND% "%CACHEDIR%" -type f -mtime +%DAYSTOKEEP% -print >> %TEMPFILE% REM show results, if any for /f "usebackq" %%i in (`%FIND% "%TMP%" -size +1 -name %NAME%.tmp`) do ( if /i "%SHOWRESULTSFIRST%"=="TRUE" ( echo files to be deleted: type %TEMPFILE% pause ) for /f "delims=#" %%i in (%TEMPFILE%) do del "%%i" ) REM clean up del /f /q %TEMPFILE% endlocal ===== Keeping Playground Clean ===== To keep the wiki's [[playground:Playground]] and other pages clean, use a cron job e.g. every 30 minutes, that restores Playground and other pages to their original content. Example: Restore Playground every 30 min: 0,30 * * * * cp -f /path/to/savedwiki/data/pages/playground/playground.txt /path/to/dokuwiki/data/pages/playground/ Example: Restore all pages in [[:namespace]] "wiki" every 30 min: 0,30 * * * * cp -rf /path/to/savedwiki/data/pages/wiki/ /path/to/dokuwiki/data/pages/wiki/ ==== Problems with CAPTCHA plugin ==== Using the CAPTCHA plugin and the recommended [[tips:maintenance#keeping_playground_clean|maintenance method]] to keep the playground clean, can result in the effect of being unable to edit the playground. When this occurs, the problem can be easily resolved by removing the related playground files in the meta folder with the next cronjob. Example: Deletes Playground metafiles every 30 min: 0,30 * * * * rm -f /path/to/dokuwiki/data/meta/playground/playground.* ===== When cronjob is not available ===== When your hosting doesn't allow to use cronjobs, consider using the [[plugin:cronojob|cronojob]] plugin instead. ===== Discussion ===== Could you please provide PHP versions of these scripts to use with the cronojob plugin? ---- > Regarding the above cleanup script which uses file modification time (mtime), wouldn't it be safer to use the timestamp in the filename to determine if a file in the attic should be deleted or not? On the one hand, I'd say it could be done but it's of course trickier to set up. For many installations it will be fine to use mtime. On the other hand, some might want to make sure they clean up old files no matter what (e.g. files left after a crash or critical PHP error). ---- Could someone add the appropriate line for [[https://forum.dokuwiki.org/post/22265|cache maintenance]] to the Windows waRmZip script? ---- Does the [[plugin:cleanup|cleanup Plugin]] handle all the above tasks? Would it be recommended over running these scripts? ---- This is example of php script to clean old cache files. useful when .sh is not available to run. ($expire_time*60*60*24 )) { // Now do something with the olders files... print "The file $Filename is older than $expire_time days \n"; // For example deleting files: // unlink($Filename); } } echo 'ran'; ?> use this at your own risk. --- [[user>goldseed|S.C. Yoo]] //2012/02/10 12:49// ---- Cheers, I'd like to add that it is a good idea to clean up orphaned meta data, don't you think? I do the following (in an R script): -list all files in the pages directory recursively -add a column 'pagename' to this list that countains the file name again but without the base directory -in pagename exchange '/' (or '\') with ':' and remove the file extension -do the same for the meta directory + exclude some additional files -remove all entries from the meta-list from which the page name is in the pages-list -delete all files left in the meta list Of course one could add a time constraint on it so that you don't use metadata immediately. Clemo //2016/09/23 sometime//