Page for information about making WWW pages
Foreword on web publishing
The World Wide Web is a multimedia hypertext information
retrieval system that sits on top of Internet.
Publishing on the Web is very different from older methods of publication. A Web publication is inherently a general, device-independent and program-independent document with structural markup.
The presentation of a document may vary greatly to allow viewing
the same document on a wide variety of devices, ranging from
small mobile phone screen to full-size movie screens.
The HTML language was designed to promote worldwide distribution of
documents in a device-independent form. HTML file consists of content
and it's structure all stored in one file in a standardized form.
HTML is far from being perfect for the purpose, but it has served well and
is suitable for a wide range of documents. It is easy to learn and easy to use.
Practically any computer literate people can put the documents onto the
Web in a few hours, after an initial education of a day or two.
One reason for putting up this web page is to provide people
a place to find the information on web publishing from one
place and learn the most necessary skills in few days.
Some people (mostrly from traditional publishing industries)
think that Web authors should decide the physical appearance
of documents like font, color, layout, and other presentation features.
For such reasons HTML is implemented usually with nonstandard extensions
(some of them though later standardized) to control the dicument
features like colors, fonts and font sizes.
This kind of way of making web pages breaks the whole basic idea
that HTML should be viewable with practically any kind of device.
Many advertised "HTML programing skills" are quite much bluff
and breaking the whole idea of HTML.
Journalists may say that presentation issues cannot be distinguished
from structure and content, so presentation must be designed for each
concrete publication and issue separately.
Layout will not lose its importance but it will take more and more
place on users' systems and the users have their own preferences on
layout and style (colors and margins and fonts and so on).
In such case layout by the author or by the publishing side will
not get through as wanted and an attempt to enforce it might fail miserably.
When the presentation fails,
the document will look like a mess and the user will therefore discard it.
If exact presentation always in the same look and layout is essential
to the document, it is usually be better to publish such
documents but by using other methods than HTML (for example
Adpbe Acrobat PDF format is very suitable format for publishing
such documents on the web).
Linking turns texts to hypertext, and hypertextuality is among the key
factors that make the World-Wide Web a web of interconnected things.
Using links on your Web pages, you can conveniently let your readers
find background information, technical details,
definitions of terms, etc., on your pages or somewhere else.
A link is a just a pointer or reference, it does not do anything by itself.
A URL is rather like a telephone number or a street address which just
tells how to reach a document on the web. The general rule proposed
for linking is that one may freely set up non-framed HREF links to the web sites of others, is a rather reassuring rule since it happens to comport well with common practice and with common sense.
Webmaster should be prepared for the possibility that members of the public may set up bookmarks to subpages, and that other HTML authors may set up links to subpages. Since this sort of bookmarking and linking can and will happen, the webmaster should be courteous to those visitors and HTML authors. The webmaster, upon moving a page, should have the courtesy to supply a "forwarding" page that lets the visitor know the new page URL.
It is very good idea to have linked on the pages recognized as links
so that users have no difficulty in noticing which word is a link.
It belongs to the very basic skills needed in Web browsing to be able to
recognize a link as displayed on one's browser and to follow a link.
Any attemt to hide the link or change the apprearance what tha user
had used to see on their browser will usually lead to more harms
to usability than good to the presentation. When you do not
show clear what is link and what is not, the
browsing becomes guesswork, trial and error.
Speed and usablity are also necessary factors to consider.
Most of the web is is too slow nowadays. The slow operation
comes from the available bandwidth to access the site,
amount of data to be transferred and how the material
is put to the pages. The bandwidth available is controlled
by the network connection your web server has to Internet
and the speed of the connection your client has and the
load of the core network in between. This speed hard
to control by the web master in any other way than selecting
good connection from a good networking company or using a good
web hotel service. Amount of transferred data and how it is
presented can be controlled by proper web design. You can reduce
the amount of transferred data by making good HTML code
(some HTML tools produce few times bigger HTML files than
what is needed to present the document), slecting right
file formats and conversion parameters for pictures and other
documents and avoiding using unnecessary large pictures.
Sometimes the loading speed seen by user is also affected by
the web browser rending speed and way they do it
(for example some versions of Netscape draw tables only when
it has get all it's cells completely which sometimes can take
lots of time). With good network connection, powerful
enough web server and good web design the web page accesses
can be made quick to operate.
When making a new large web site it is a good idea to design the site
structure well so that you don't have to redesign it many time over and over.
Usually this means designing the menus and directory structures
for the whole site. Whjen you get this ready don't hurry in putting
it online.Publishing just a site skeleton with menus and subpages without real
content in the end just frustrates the users who come by to your site.
It is better to first make a small working site with little material and
then enhance it later than publishing a large page structure without any
content. It is better to promise what is coming later on the
main oage or subpage than making links to pages which just say
"under construction" or "coming soon" or "page not found".
Users of the site just get frustrated because you seemed
to promise than your site has some interresting material by
putting the link visible, but failed to keepp that promise. Users
of the web pages are dissappointed when they encounter many this
kind of thing on your page and go to some other site and
propably never come back. The fact of the life is that
that most of the things promised to "come soon" usually don't
come anytime soon on-line if ever come on-line. A web site
with lots of pages with just "coming soon" in them is
just makes many dissapointed users who don't come back to
your site later to check out of your promises of new
material is true or not (because on most those promises
are not kept).
Here are some general rules on web publishing:
- 1. The pages yopu publish must serve a purpose. There's no point in having a page if it has nothing useful to offer.
- 2. Be economical with graphics. Many users have limited bandwidth and don't want to wait their pages to load for too long.
- 3. Always use the alt field with images so that people who can't or don't wnat to see the grpahcis can still use your page.
- 4. Use GIF or JPEG file formats appropriately for the application they are good for.
- 5. Keep in mind that readership is international. Always write the date in words because numerical formats are interpreted differently in different countries. Be aware of words which belong to a local dialect.
- 6. Keep your page browser-independent. The last thing you want to do is turn away potential readers merely because they have the "wrong" software.
- 7. Support graceful degradation. Graceful degradation as a means to provide for backward compatibility is built into the W3C's recommendations. Do not try to break it.
- 8. Write correct HTML. It works better with any browser than incorrect one with errors.
- 9. It is a good idea to use a footer which gives the document URL, and an email contact address. This practice leaves the reader with no doubt as to the authorship of the document.
- 10. Publish the web site only when it has content worth to show. If you do not have much ready now, make a small working site and enhance it when you get more material.
Web site performance review checklist:
- Dead link free Web site
- Compatible on all browsers (i.e. Netscape, Internet Explorer, Macintosh)
- HTML error free Web site
- Proper usage of Meta Tags throughout HTML and content of Web site
- Theme of site is accurately recognized using top keywords
- Spelling error free throughout site
- Fast load time on various connection speeds (i.e. 56K, ISDN, T1)
- Search Engine acceptance ready
- Easy to navigate and fun to use
- Nice look and feel
HTML Specifications
Introductory material on HTML
Tips on writing HTML documents
Browser specific extensions
Be careful if you use those browser specific extensions.
The Extensible Markup Language (XML) is a subset of SGML.
Its goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML. XML has been designed for ease of implementation and for interoperability with both SGML and HTML.
An XML page looks something like an HTML page, but there the similarity ends. XML uses HTML-style tags not just to format documents, but also to identify the kinds of information used in documents, so that information can be reformatted for use in other documents and can also be used for information processing.
To put that another way, you can use XML to represent data portably.
XML is clearly the way of the future as it establishes a common base from which content can be expanded into other formats.
XSL is a technology related to XML. XSL provides a mechanism for formatting and transforming XML, either at the browser or on the server.
General information
XML Resource pages
- Developer Works - IBM's XML resource, with lots of valuable XML compatibility information and home to xCentral, the XML web search engine
- The Development Exchange's XML Zone - home to links for valuable XML news, information, and web sites
- The XML Cover Pages - a comprehensive online reference work for the Extensible Markup Language (XML) and its parent, the Standard Generalized Markup Language (SGML)
- xml.com - XML information site
- XML Magazine - Timely news and information for the XML industry, with features written by experts and insiders
- XML Resources - links to XML related documents
- XML World - XML article collection with a Software Guide, FAQ, References and more
Applications for XML
- MathML - mathematical markup language
- SEML - Semi-Extensible Markup Language is a new language similar to XHTML and WML that allows the serving of both WML or HTML from a single source document
- SMIL - syncronized multimedia integration language
- WML - wireless markup language, used in WAP system
- XHTML - HTML 4.01 defined according to XML notations
Using XML with your programs and applications
XML Tools
- Xalan - an XSLT processor for transforming XML documents into HTML, text, or other XML document types
- Microsoft XML Notepad - a simple prototyping application for HTML authors and developers that enables the rapid building and editing of small sets of XML-based data
- XT - XT is an implementation in Java of XSL Transformations.
SMIL information
SMIL is a XML based language for making multimedia presentations.
It enables simple authoring of TV-like multimedia presentations such as training courses on the Web. The SMIL language is an
easy-to-learn HTML-like language. Thus, SMIL presentations can be written using a simple text-editor. A SMIL
presentation can be composed of streaming audio, streaming video, images, text or any other media type.
General information
SMIL Browsers
Other XML software
Character set problems related to Finland
General guides
Usability design
Site accessability
Use of frames
Use of tables
Handling errors
Multimedia
Specific site design tips
Bad things to avoid
Useful sites for more information
Why to validate
Page validators
Link checkers
Frames allow you to divide the page into several rectangular areas and to display a separate document in each rectangle. Each of those rectangles is called a "frame".
Frames are very popular because they are one of the few ways to keep part of the page stationary while other parts change. Frames are also one of the most controversial uses of HTML, because of the way the frames concept was designed, and because many web framed web sites are poorly implemented.
Normal frames are used to divide the entire browser window (or a frame) to subwindows.
Inline frames appear inside the presentation of a document and allow embedding relatively small documents onto pages.
Several technologies are used to create customized Web pages, including the Perl language, Microsoft's Active Server Pages, the PHP hypertext preprocessor, Java, JavaScript and other methods.
Nowadays customizd web pages can be created in the web server or on
the fly on the client side. There are many different scripting technologies
for this kind of and they are each suitable for different applications.
Java is Sun's cross-platform, object-based programmign language
which can be used for wide variety of applications. In web environment
Java is most often used as applets which are included to webpages and
then run by the client web browser Java virtual machine. Java can also
be used to make server side web server extensions called servlets.
JavaScript is Netscape's cross-platform, object-based scripting language for client and server applications. JavaScript is most commonly used to add code to web page which is executed by the web browser in the client computer. JavaScript can also be used to make server side scripts on Netscape web server.
Even though the name is quite similar to to Sun's Java, the JavaScript is
completely different technology than Java.
General information
Javascript tutorials
Javascript references
Javascrit books
Example source code resources
Related technologies
- An Introduction To DHTML - JavaScript and HTML make a nice couple
- ECMAScript - based on several originating technologies, the most well-known being JavaScript (Netscape Communications) and JScript (Microsoft Corporation)
PHP is a simple scripting language to implement server side fuctionality
to web servers which support that scripting language.
Sctive Server Pages (ASP) is a scripting system included in
Microsoft web servers which makes it possible to include
server side scripts to web pages.
There is also an ASP port for Apache web server.
Adding some web server executable code to the web pages using
some suitable tools. Those pieces of code within the page
are executed and the output of those scripts plus the original
HTML (without those executable codes) are sent to the web browser.
Web server software
- Apache - propably the best and most widely used free web server program with good documentation available on-line
- Jigsaw - The W3C's Server - Jigsaw is a Web server platform, providing a sample HTTP 1.1 implementation and a variety of other features on top of an architecture implemented in Java.
Using Apache software
Using other web server software
PERL is a fairly straightforward, widely known and well-respected scripting language. It is used for a variety of tasks, but in the context of web development it is used mainly to develop CGI scripts.
Python is a powerful, modern, free, open source, general purpose,
interpreted programming language developed by Guido van Rossum.
It is used by many programmers around the world. Python is easy to learn
and allows very rapid developments. Pythin canbe used as scripting language
for example for writing CGI-scripts or it can be embedded to HTML
page using Zope system.
CGI-scripts are executable programs separate programs which are run by the
web server for generating a web page on the fly (dynamic web pages, web searches)
or for processing the information sen by the client (usually information
from form). CGI-scripts can be written using practically any programming
laguage and using many scripting languages. Propably the most commonly
used language for making CGI scripts is Perl but there are many other
possibilities also for this.
A lot of people have Web pages but most feel that CGI scripts are
"over their head", but you can get started quite easily with many
free CGI scripts if your web server you are using supports them.
General information
Information on developing CGI scripts
CGI scripts and script collections
Solving problems
Display properties
Pictures
- Free Web Templates.com - free templates, free wallpapers, free textures, free buttons
- Pagekits - website templates, with all the code and images you need to get started in making your own web site
Video and web
Audio and web
Multimedia player plug-ins
General web legal topics
Linking
Processing and converting material
- Arachnophilia - Powerful Web Site Workshop with many options
- Converting UNIX to DOS and DOS to UNIX
- gohtml.com - free on-line service which converts many document formats to HTML
- HTMLDOC - Convert HTML files to PDF or PostScript
- HTML Tidy - processes hard to read HTML files to simpler and easier to manipulate manually
- JTidy - JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, JTidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM interface to the document that is being processed, which effectively makes you able to use JTidy as a DOM parser for real-world HTML.
- Weblint - free HTML validation program written in Perl
- Website META Language (WML) - off-line HTML generation toolkit for Unix
- wvWare - wv is a library which allows access to Microsoft Word files. It can load and parse the word 2000, 97, 95 and 6 file formats. wv allows other programs access to Word documents for the purpose of converting them to other formats (for example HTML).
Web page upload and download
- Expect FAQ - FAQ on a very useful tool program for automating interctive programs like FTP
- GNU Wget - free utility for retrieving files from the internet using HTTP and FTP
- Gnu Wget - a freely available network utility to retrieve files using HTTP and FTP
- Lynx - most popular text only browser, useful in scripting and HTML to text conversions
- WebReaper - web crawler or spider, which can work its way through a website, downloading pages, pictures and objects that it finds so that they can be viewed locally, without needing to be connected to the internet
Off-line browsing programs
On-line tools
- WebCab.de: Fetch Page - tool for getting web pages from server and seeing debug information, this tool can imitate many browsers and even WAP
Scripting languages
WWW based discussion boards
- Discus - free discussion board script
- Ezboard - online community service
- Phorum - Phorum is a web based discussion software written in PHP. Unlike some popular web boards, Phorum utilizes a database to manage its messages.
- WWWBoard - free discussion group making script used in many sites
Converting HTML to other formats
- HTMLDOC - Convert HTML files to PDF or PostScript
- html2ps - HTML to PostScript converter that gives you nice printouts of HTML pages than many web browser's build-in printing functions
Adobe Acrobat (pdf) file generators
- HTMLDOC - Convert HTML files to PDF or PostScript
- FreePDF - Free PDF is a utility which allows you to create PDFs just by printing to a "Create PDF" printer, from any Windows 9x/ME application FOR FREE!
Other utilities
- Apache - freely available good web server software for UNIX, widely used
- Microsoft Internet Information Server - web server for Windows NT
- Netscape Web Server products - available for many platforms
Browser programs
- BROWSERS.COM - download new releases of latest versions of popular browsers
- Lynx - most popular text only browser
- Netscape Navigator - the most popular web browser
- Microsoft Internet Explorer
- Mozilla - web browser software with XML and CSS Level 2 support
- Opera - small and fast web browser
Plug-ins
Web material filters
- The Internet Junkbuster Proxy - blocks unwanted banner ads and protects your privacy from cookies and other threats
- WebWasher - filter program for Web pages to avoid unwanted content and for faster loading times
Other useful utilities
- Web Secretary - detects changes based on content analysis and can email the changed page to you with the new contents highlighted!
- WebStripper - makes it easy to copy websites to your hard disk for you to browse offline later, free advertising supported software
A Web cache sits between Web servers (or origin servers) and a client
and watches requests for HTML pages, images and files come by, saving a copy for itself. Then, if there is another request for the same object, it will use the copy that it has, instead of asking the origin server for it again.
This is used to reduce latency and to reduce traffic.
General information on search engines
Information on successfully adding web pages to search engines
Controlling search engines and robots
Search engines you can add to your site
- Atomz.com - site search service to quickly add a powerful search engine to your Web site with no hardware or software to install
- FreeFind.com - free search engine to add to a site
- ht://Dig Internet Search Engines - world wide web indexing and searching system for a small domain or intranet available under GNU General Public License
- Isearch - Isearch is an open source software package for indexing and searching text documents. It supports full text and field based searching, relevance ranked results and Boolean queries, and it can index many kinds of documents, including HTML, mail folders, list digests, and text with SGML-style mark up. Isearch also includes CGI tools to provide a web interface for searching.
- Perlfect Search - customizable and effective site indexing/searching suite available under an open source licence
- Sitelevel - free site search service, includes banner advertising
- WebGlimpse - powerful indexing and query system for UNIX that allows you to search through all your files very quickly
- Matt's Script Archive - includes also search scripts
- Using SWISH-E To Index Your Site - information on a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other text files
HTTP has been in use by the World Wide Web since 1990 and its use has increased steadily over the years, mainly because it has proven useful as a generic middleware protocol.
The following web sites provide free web hosting services.
Usually the free web hosting services will add advertisements
to the pages in them. For more details check what those service
providers have to offer.
www.epanorama.net <webmaster@epanorama.net>
Send feedback about this page
Recommend this page to a friend
Back to home page