Improving Performance of World Wide Web by Adaptive Web Traffic Reduction
Improving Performance of World Wide Web by Adaptive Web Traffic Reduction
bandwidth dynamically by considering the correlation can operate on a client or server or on an intermediate system.
characteristics. In proxy caching, proxy server stores cacheable responses to
Cache stores cacheable requests and responses for handling the URL requests, subsequent requests for the same URLs
new requests. If a new request that is same as a stored request yields cache hit. The URL request from the client is forwarded
arrives, then cache can supply the stored response rather than through proxy servers. The proxy servers accept the URL
accessing the resource from the web server. The web caching request from the client and it checks in its own local cache for
and web cache sharing schemes are useful for latency local hit, if requested object is not available there, it forwards
reduction, bandwidth conservation and disconnected that request to the web server. In web cache sharing, for each
operation. If a group of clients are topologically close and URL request, the proxy first checks in its local cache, if there
under common administrative control, then the administrator is no local hit, then that proxy checks in other proxies cache
could install one or more proxy caches in front of the clients for remote hit. If there is no remote hit, then it forwards that
for forming a cluster to lower the client-perceived latency. The request to the web server. Upon receipt of that document, the
methodologies for client cluster identification are discussed in requested proxy server stores it in cache and returns the
[10]. The various web caching techniques, caching protocols document to the client. Due to dynamic documents, CPU
and caching architectures are discussed in [11] - [17]. To overhead, memory spending for caching and administrative
extend the effectiveness of HTTP caches, techniques such as overhead, web caching techniques are considered as
cooperative caching, pre-fetching, partial transfers, delta unsatisfactory techniques. But when the network bottleneck
encoding, cache based compaction and HTML macros are such as congestion is compared, these overhead are negligible.
developed [14, 15]. A detective browser is proposed in [18], it
can immediately determine whether the requested content is D. Web Server Pushing
dynamic or secured. If so, the browser will bypass the proxy Web server pushes some or all of the documents to some
and forward the request directly to the web server. place near the client site. When the client accesses a
document, it will go to a site that is nearer to the client [21].
III. WEB TRAFFIC REDUCTION TECHNIQUES This approach reduces the latency in accessing the documents.
Web traffic reduction techniques are necessary for accessing In this technique, it is difficult for a web content provider to
the web sites efficiently with existing network facility. It is know the proper place to push documents.
costlier to use infinite bandwidth in all organizations. Many
studies show that the web caching has the maximum limit of E. Browser Initiated Server Pushing (BIP)
cache hit ratio of 50%. But pre-fetching can improve the hit In BIP, upon receiving a HTTP request, the server actively
ratio to 60% or even more than 80% [19]. But pre-fetching pushes embedded contents if the permission is given by the
techniques increases the web traffic for pre-fetching the client [21]. By means of the pushing mechanism, the HTML
anticipated sites. Various techniques available for web traffic web page will be down loaded in one RTT if the embedded
reduction are given below. contents and the HTML web page are on the same server. This
approach reduces the download latency for web pages and to
A. Content Simplification improve web server resource utilization. In the browser
Web designers can use common sense to reduce page initiated server pushing, the server does not know what is in
complexity or special tools can be used to optimize image the client cache. Without considering the client cache
coding. But some data such as medical images, broadcast contents, the server pushes much more images than actually
quality videos and executable software cannot be simplified needed.
without loss of meaning. Optimization of HTML tags can be
done for reducing the size of the web page to be transmitted F. Load Balancing Techniques
[5]. But the content simplification and optimization techniques In this approach, if any server is over loaded, its jobs are
cannot be used for much reduction in web traffic. shared by other under loaded servers. Load balancing systems
monitor the health of the available servers and make decisions
B. Compression to route the traffic for optimizing the performance and
Redundant bits within a single transfer can be reduced using availability. This ensures that the users will be connected to
compression techniques. Existing general purpose the most available server, providing excellent and predictable
compression algorithms provide significant size reductions. quality of service to the end user [22]. Many traffic
Several compression techniques such as Huffman coding, LZ, management and load balancing techniques are discussed in
LZW, JPEG, MPEG, H.261, H.263 are available for text, [23]. An adaptive load balancing scheme for web servers is
image, audio and video compression [20]. Compression ratio discussed in [22]. An adaptive multi agent coordination
depends on the compression algorithm and size of the file to algorithm is proposed in [24] for performing distributed
be compressed. These techniques can reduce only temporal or dynamic load balancing. A load cluster management system
spatial redundancy. But it can’t reduce the frequent based on SNMP and web technology is discussed in [25]. A
transmission of the same file through the network. dynamic load balancing algorithm is proposed in [26] for
improving throughput. Even if several load balancing
C. Web Caching algorithms are available, it is difficult for distributing the work
A cache is a facility that stores cacheable requests and among the existing servers and balancing the work among
responses. Subsequent requests can be satisfied from the cache them.
instead of accessing the objects from web server. The cache
G. Intelligent Agents technique and combines the existing techniques such as web
Intelligent agent monitors user actions and dynamically caching, web cache sharing, pre-fetching, traffic dispersion,
takes decision to access web sites by analyzing the traffic on compression etc. Traffic reduction algorithms are developed
the internet. The intelligent agent monitors the web traffic and for combining these techniques. Web traffic reduction is
it reports the status of bandwidth usage to the browser [27]. achieved in adaptive manner by monitoring the user’s
The browser parses the HTML page and the agent makes preferences and bandwidth usage. In this technique, intelligent
prediction about future references. If the traffic reported by the agents monitor the bandwidth usage and select the best
intelligent agents are less than the threshold, then the browser techniques at the client side and server side to achieve
can pre-fetch the predicted references before it is actually maximum traffic reduction. Since it is a hybrid technique,
referenced, potentially resulting in much lower latencies. efficient bandwidth utilization and more web traffic reduction
Using intelligent agents delay in accessing the network, are achieved. Schematic block diagram of the functions of
bandwidth consumption and network bottlenecks can be intelligent agents at web server and browser side is shown in
reduced. Fig. 1.
V. ADAPTIVE WEB TRAFFIC REDUCTION TECHNIQUE B. Web Traffic Reduction at Server Side
In this new adaptive technique, intelligent agents are The intelligent agents at the server side negotiate with the
maintained at the client and server side for monitoring the web web browser about its capabilities. Depending upon the
traffic. Adaptive web traffic reduction technique is a hybrid capabilities of the browser, the intelligent agents take dynamic
decision about sending the documents in compressed form. (b) If the bandwidth usage is between threshold values
The proposed algorithm for web traffic reduction at server side say x1 and x2, send medium quality image, audio
is given below. and video files.
(c) If bandwidth usage is greater than a threshold
value say x3, send only low quality image, audio
and video files
2. Intelligent agents identify the large files and send it in
compressed form.
3. Intelligent agents separate the static pages from
dynamic pages and make the static pages as public to
cache it in browser side.
4. Intelligent agents identify congestion and reduce the
traffic in that route.
5. Intelligent agents convert the burst nature of the
traffic into sub bursts.
6. Intelligent agents optimize the HTML markup
generated by WYSIWYG programs. Only optimized
HTML is delivered to the browser
100
80
60
40
20
0
5 10 15 20 25 30
Time (minutes)
DPB IE Netscape
current network. Intelligent agents help the browser to monitor [3] Jude Shavlik, Susan Calcari, Tina Eliassi-Rad and jack Solock, “An
Instructable Adaptive Interface for Discovering and Monitoring
the bandwidth usage and adjust the pre-fetching of subsequent Information on the World-Wide Web”, Proceedings of International
links to make constant traffic and to avoid the network Conference on Intelligent user interfaces, ACM, 1999, pp.157-160.
bottleneck such as congestion. We have also observed that, by [4] Hiroshi Fujinoki, Kiran K Gollamudi, “Object Packaging – Web
incorporating the traffic reduction algorithms in the browser, Response Time Reduction for slow and busy web servers”, Proc. of the
27th Annual Conference on Local Computer Networks, IEEE Computer
latency is reduced at a range of 20 – 60 % and cache hit ratio
society, 2002.
increased 40 – 82 % as shown in Fig. 3 and 4. [5] Jacqueline Spiesser, Less Kitchen, “Optimization of HTML
Automatically generated by WYSIWYG Programs”, Proceedings of the
ACM SIGPLAN 2004 Haskell Workshop, 2004, pp.80-91.
100 [6] Dongjun Shin, Kern Koh, “Optimizing Web Content Delivery using
Web Server Accelerator”, 25th Australasian Computer Science
Cache Hit Ratio
80
Conference, 2002, pp.233-239.
60 [7] Fumio Ishizaki, “Study on reduction of total bandwidth requirement by
traffic dispersion”, In Proceedings of International conference on ATM,
40 IEEE, 2001, pp.285-289.
[8] Chris Drummond, Dan Ionescu, Robert Holte, “Intelligent Browsing for
20
Multimedia Applications”, Proceedings of Multimedia, IEEE, 1996, pp.
0 386-399.
1 3 5 7 9 11 13 15 17 19 21 23 25 [9] Yen-Wen Chen, “Experimental Study of Internet Traffic Modeling and
Bandwidth Allocation”, IEEE Pacific Rim Conference on
Time (Minutes)
Communications, Computers and Signal Processing, 2001, pp.587-590.
[10] Balachander Krishnamurthy, Jia Wang, “On Network – Aware
Adaptive Web Traffic Reduction Technique Web Caching Technique Clustering of Web Clients”, Proceedings of ACM SIGCOMM, 2000, pp.
Fig. 3 Analysis of Cache Hit Ratio by Adaptive Reduction Technique 97-108.
[11] Hossam Hassanein, Zhengang Liang and Patrick Martin, “Performance
and Web Caching Technique Comparison of Alternative Web Caching Techniques”, Procceedings of
the seventh International Symposium on Computers and
Communications, IEEE, 2002.
700 [12] C. Mala and J.S. Jayasudha, “Web Cache Sharing Techniques”,
Proceedings of 6th International Conference of High Performance
Time (Seconds)
600
500 Computing, Asia, 2002.
400 [13] M. Liu, F. Wang, D. Zeng, L.Yang, “An Overview of world wide Web
300 Caching”, International conference on Systems Man and Cybernetics,
200 IEEE, 2001, pp.3045-3050.
100 [14] Greg Barish and Katia, “World Wide Web Caching: Trends and
0 Techniques”, IEEE Communication, May 2000, pp. 178-185.
[15] Jeffrey C. Mogul, “Squeezing More Bits Out of HTTP Caches”, IEEE
1 3 5 7 9 11 13 15 17 19 21 23 25 Network, May/June 2000, pp. 6-12.
No. of Sites [16] Hykyung Bahn, “A Shared Cache Solution for the Home Internet
Gateway”, IEEE Transactions on Consumer Electronics, Vol.50, No.1,
Adaptive Web Traffic Reduction Technique Feb 2004, pp. 168-172.
Without Adaptive Reduction Technique [17] Li Xiao, Xiaodong Zhang, Artur Andrzejak, Songqing Chen, “Building a
Large and Efficient Hybrid Peer to Peer Internet Caching System”, IEEE
Fig. 4 Analysis of Latency by Adaptive Reduction Technique and Transactions on Knowledge and Data Engineering, Vol 16, No.6, June
Web Caching Technique 2004, pp.754-769.
[18] Songqing Chen and Xiaodong Zhang, “Detective Browsers : A Software
Technique to Improve Web Access Performance and Security”,
Proceedings of 7th International workshop on Web content caching and
VII. CONCLUSION Distribution (WCW’ 02), 2002.
The intelligent agents help the browser and web server to [19] Shi L, Gu Z, Wei L and Shi Y. “Popularity-based selective markov
model”. In Proceedings of the IEEE/WIC/ACM International
monitor the bandwidth usage and user’s preferences. In this Conference on Web Intelligence, 2004, pp. 504-507.
paper an adaptive technique is proposed for reducing web [20] Fred halsall, Mutimedia Communications, Pearson Education, 2002.
traffic and to access the web sites efficiently. The proposed [21] Wenting Tang, Matt W. Mutka, “Intelligent Browser Initiated Server
algorithms at client side and server side are efficient to reduce Pushing”, IEEE International Conference on Performance,
Computing, and Communications Conference, 2000, pp.17-23.
the web traffic in adaptive manner. The simulation results [22] James Aweya, M. Ouellette, D.Y Montuno, B. Doray, K. Felske, “An
show that this adaptive technique maintains almost constant Adaptive Load Balancing Scheme for web servers”, International
traffic and provides effective bandwidth usage. Since it is a Journal of Network Management, 2002, pp.3-39.
hybrid technique, latency is reduced to 20 – 60 % and cache [23] http://www.zenus.com/solutions/zlb/indexfull.html
[24] David Vengerov, Hamid R Berneji, Alex Vengerov, “An adaptive
hit ratio is increased 40 – 82 %. New technologies are to be coordination among fuzzy reinforcement learning agents performing
developed to reduce web traffic without increasing the cost distributed dynamic load balancing”, Proceedings of 11th IEEE
spent for networking. International Conference on Fuzzy Systems, 2002, pp.179-184.
[25] Myung-Sup kim, Mi-Jeong Choi, James W. Hong, “Highly available
and efficient load cluster management system using SNMP and web”,
REFERENCES Proceedings of Network Operations and Management Symposium,
[1] Jae-Won Choi and Kwang-Hui Lee, “A Web-based Management System IEEE, 2002, pp.619-632.
for Network Monitoring”, IEEE workshop on IP operations and [26] Hemant B. More, Jie Wu, “Throughput Improvement through Dynamic
management (IPOM 2002), 2002, pp. 98-102. Load Balance”, IEEE southeast con’94, 1994, pp.339-342.
[2] Zhen Sheng Guo, Yan Zhuang, “Improving Network Performance by [27] Jayasudha J.S, Achuthsankar S. Nair, “An Intelligent Browser for Web
Traffic Reduction”, International Conference on Information, Traffic and Latency Reduction”, Proceedings of International
Communication and Signal Processing, 1997, pp. 1226-1230.