# I. INTRODUCTION flat file is also called plain text file it is semi structured file. A flat file consisted of data which is separated with commas, white spaces, tabs, tube (|) any many other characters. Al-Dubaee et al. ( 2010) stated that a flat file is a plain text or mixed text and binary file which usually contains one record per line or physical record (example on disc or tape). Within such a record, the single fields can be separated by delimiters, e.g. commas, or have a fixed length. In the latter case, padding may be needed to achieve this length. Extra formatting may be needed to avoid delimiter collision. There are no structural relationships between the records. Mathew (2005) stated that a flat file database should consist of nothing but data and, if records vary in length, delimiters. More broadly, the term refers to any database which exists in a single file in the form of rows and columns, with no relationships or links between records and fields except the table structure. Pratnortis (2005) stated that the advantage of a flat file is that it takes up less space than a structured file. However, it requires the application to have knowledge of how the data is organized within the file. By using Structure Query language and a database (rather than a collection of files in a file system), a user or an application is free from having to understand the location and layout of data (for example, the length of each item of data, its type of data, and its relationship to other data items). Another form of flat file is one in which table data is gathered in lines of ASCII text with the value from each table cell separated by a comma and each row represented with a new line. This type of flat file is also known as a comma-separated values file. # II. DIFFERENT TERMS RELATED TO RESEARCH a) Flat file Flat file is a single lined plain text file containing delimiters comma, tabs, tube (|) and many other characters. It is also called mixed text file. The main advantage of the flat file is that it takes less space as compare to the structured file. # b) Data warehouse Data ware house is a large data base which contains historical data in the form of cubes (Aggregates) and decision maker use this data for future decisions. Author ? : Muhammad Inayat ullah, University of Engineering & Technology, Peshawar, pakistan Telephone : +923339977807, E-mail : inayatbinzeb@gmail.com Author ? : Zeeshan, ICMS, Hayatabad phase-V, Peshawar, Pakistan. Telephone: +923139300775 E-mail : zeese2010@gmail.com Author ? : Mahwish Kundi,Gomal University,Dera Ismail Khan, Pakistan. Telephone: +923459774168 E-mail : kundikhan84@gmail.com III. # STEPS FOR SOLVING THE PROBLEM The main steps in conducting the research work are given as under: 1. Read and review the research paper related to the data warehousing, flat file (Semi structured) data. 2. Review the research critically. 3. Design the effective algorithm to transform the flat file (semi structured) Data into structured data. 4. Implement the algorithm and check output. # IV. SOLUTION TO THE PROBLEM AND METHODOLOGY Steps for the solution to mentioned problem are: 1) Take a flat file. 2) With the help of proposed algorithm convert it into html form. # CONCLUSION In data warehouse data comes from different operational systems and flat file is one of the semi structured plain text file that comes form source and to store it in data warehouse flat file doesn't store directly in the data warehouse it need some transformation in . 20113![Global Journals Inc. (US) 43 Import html data into the database. 4) Manage (updation/deletion/insertion) the flat file data in the data warehouse. V. ALGORITHM Input : Flat file containing single line plain text data. Output : Data transform into uniform format. Begin 1. Enter Input flat file 2. Flat file transform to uniform format 3. Transform uniform data to html form 4. If flat file is transformed as a html then 5. Import html data into database. 6. Extract stored data from the database. 7. Manage (updation/deletion/insertion) of the flat file data in the data warehouse. End](image-2.png "A © 2011 ? 3 )") 12![Figure.1 : Transformation Model of Flat File](image-3.png "Figure. 1 :Figure. 2 :") © 2011 Global Journals Inc. (US) August ## August ETL (Extract, Transform and load) of the data warehouse which is the core component of the data warehouse. In this paper a flat file containing delimiters comma, colon and tube ( | ) characters and an algorithm is used for transformation of the flat file into uniform format means in html form and then transformed in to structured data. Flat file management (updation/deletion/insertion) is easy when transform it into structured format. Proposed algorithm is implemented in PHP and MySQL tools. * Multilingual Lossy Text Compression Using Wavelet Transform SAAl-Dubaee NAhmad First International Conference on Integrated Intelligent Computing. (ICIIC) 2010 * The Impact of Site Structure and User Environment on Session Reconstruction in Web Usage Analysis BettinaBerendt BamshadMobasher MikiNakagawa MyraSpiliopoulou Proceeding of the WEBKDD 2002 Workshop eeding of the WEBKDD 2002 WorkshopEdmonton, Canada * A Language for Extracting Signatures from Data Streams KCortes DFisher APregibon FRogers Smith Hancock Proc. ACM Int. Conf. on Knowledge Discovery and Data Mining ACM Int. Conf. on Knowledge Discovery and Data Mining 2000 * MOVIEMOD:An Implementable Decision-Support System for Prerelease Market Evaluation of Motion Pictures Eliashberg JedidJJehoshua MohanbirSJonker BernardSawhney Wierenga MarketingScience 19 3 2000 * The Firm's Management of Social Interactions DavidGodes DinaMayzlin YuboChen SanjivDas ChrysanthosDellarocas BrucePfeiffer BarakLibai SubrataSen MengzeShi PeeterVerlegh Marketing Letters 16 3 2005 * Word-of-Mouth for Movies: Its Dynamics and Impact on Box Office Revenue YongLiu Journal of Marketing 3 70 2006 * Promotional Chat on the Internet DinaMayzlin Marketing Science 25 2 2006 * Conditional models of identity uncertainty with application to noun co reference AndrewMccallum BenWellner Advances in Neural Information Processing Systems 17 2005