GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:19:43 Sequence gi568815595r:180884643_181088230 : 203588 bp : 38.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3077 3463 387 1 0 42 44 312 0.301 19.06 1.02 Intr + 14713 14850 138 0 0 110 61 36 0.228 2.84 1.03 Term + 15003 15071 69 2 0 117 36 46 0.227 -0.74 1.04 PlyA + 15733 15738 6 1.05 2.00 Prom + 17089 17128 40 -4.85 2.01 Init + 28044 28094 51 2 0 109 79 83 0.398 10.71 2.02 Intr + 48692 48744 53 1 2 120 97 52 0.993 6.09 2.03 Intr + 50496 50589 94 0 1 82 87 102 0.998 8.55 2.04 Intr + 63223 63294 72 0 0 83 100 81 0.993 7.48 2.05 Intr + 63705 63853 149 2 2 53 69 150 0.998 7.71 2.06 Intr + 64079 64172 94 2 1 70 99 27 0.996 1.05 2.07 Intr + 64585 64701 117 0 0 48 100 112 0.996 8.14 2.08 Intr + 66656 66826 171 1 0 53 107 210 0.999 18.62 2.09 Intr + 69120 69198 79 2 1 100 74 82 0.988 6.21 2.10 Intr + 73177 73286 110 2 2 54 105 132 0.993 10.58 2.11 Intr + 76826 76912 87 1 0 90 76 34 0.753 1.65 2.12 Intr + 78241 78448 208 0 1 57 53 226 0.927 13.73 2.13 Intr + 83409 83612 204 1 0 67 26 221 0.607 12.05 2.14 Intr + 85444 85716 273 2 0 3 77 380 0.682 24.99 2.15 Intr + 86433 86513 81 1 0 41 93 76 0.753 2.09 2.16 Term + 91430 91650 221 0 2 100 37 261 0.974 18.42 2.17 PlyA + 92493 92498 6 1.05 3.00 Prom + 99666 99705 40 -6.55 3.01 Init + 103109 103123 15 1 0 51 80 39 0.276 -1.36 3.02 Intr + 104697 104814 118 2 1 40 105 101 0.789 6.12 3.03 Intr + 104856 105210 355 1 1 88 47 304 0.883 19.72 3.04 Intr + 117988 118040 53 1 2 56 61 36 0.001 -4.67 3.05 Term + 119133 119290 158 1 2 90 51 123 0.002 5.91 3.06 PlyA + 121139 121144 6 1.05 4.00 Prom + 122191 122230 40 -3.35 4.01 Init + 133486 133546 61 0 1 88 62 78 0.966 6.66 4.02 Term + 135560 135675 116 0 2 125 36 91 0.973 5.35 4.03 PlyA + 136685 136690 6 1.05 5.05 PlyA - 136785 136780 6 1.05 5.04 Term - 159590 159077 514 1 1 54 48 227 0.147 8.13 5.03 Intr - 160294 160155 140 1 2 43 92 73 0.162 1.54 5.02 Intr - 164785 164536 250 0 1 82 61 142 0.403 7.42 5.01 Init - 170174 170107 68 2 2 71 37 44 0.173 -1.80 5.00 Prom - 173443 173404 40 -6.05 6.04 PlyA - 173903 173898 6 1.05 6.03 Term - 176231 176116 116 0 2 130 42 77 0.962 5.05 6.02 Intr - 185000 184921 80 1 2 74 99 51 0.017 3.08 6.01 Intr - 202715 202551 165 1 0 82 116 87 0.496 9.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100068 99998 71 1 2 117 45 91 0.885 4.82 S.002 Intr - 101354 101284 71 1 2 41 115 94 0.818 5.31 S.003 Init - 184996 184921 76 1 1 83 99 39 0.921 5.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:180884643_181088230|GENSCAN_predicted_peptide_1|197_aa MASWAALSTKALHFLEETVSDLSTVRFVAHQQHFQLLGVVDQELPETTGQHVLCFFVVPI TNVGHQDLALEPSMNPVVSTSGFLPVRLNFDISVRLVPDELLGSLFEDLGLHKGSEGSHD SEEEMTATSVIVFSYYTPLRCILQPERVNLPSFKLVVLGLDLGEGNPEAQHASKRAIKFQ TVMQLELLTMASSARNP >gi568815595r:180884643_181088230|GENSCAN_predicted_CDS_1|594_bp atggccagctgggcagctctttccaccaaggctttgcatttcttggaggaaactgtgagc gatctcagcacagtaagatttgttgcacatcagcagcacttccagctccttggtgttgtg gaccaggaacttccagaaaccactgggcagcatgtgctttgtttttttgttgttcccata accaatgttgggcatcaagatctggcccttgaaccttctatgaaccctgttgtcagcacc tctgggtttctgccagttaggcttaattttgacatatcggtcagactggtgccagatgaa cttcttggttctctttttgaggatcttgggcttcataaggggtctgagggcagccatgat tccgaggaggagatgactgccacctccgtaattgtgttttcgtactacactcccctgaga tgcatcctccaacctgaaagagttaatttgccaagctttaaactggttgtcttaggattg gacttaggggaagggaacccagaagcccaacatgccagcaaaagggccatcaaattccaa acagtcatgcaactggagcttctgacaatggcctcttctgccaggaacccttaa >gi568815595r:180884643_181088230|GENSCAN_predicted_peptide_2|687_aa MAELTVEVRGSNGAFYKGFIKDVHEDSLTVVFENNWQPERQVPFNEVRLPPPPDIKKEIS EGDEVEVYSRANDQEPCGWWLAKVRMMKGEFYVIEYAACDATYNEIVTFERLRPVNQNKT VKKNTFFKCTVDVPEDLREACANENAHKDFKKAVGACRIFYHPETTQLMILSASEATVKR VNILSDMHLRSIRTKLMLMSRNEEATKHLECTKQLAAAFHEEFVVREDLMGLAIGTHGSN IQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFVEDFIQVPRNLVGKVIGKN GKVIQEIVDKSGVVRVRIEGDNENKLPREDGMVPFVFVGTKESIGNVQVLLEYHIAYLKE VEQLRMERLQIDEQLRQIGMGFRPSSTRGPEKEKGYATDESTVSSVQGSRSYSGRGRGRR GPNYTSGYGTNSELSNPSETESERKDELSDWSLAGEDDRDSRHQRDSRRRPGGRGRSVSG GRGRGGPRGGKSSISSVQYRSNIHNCSTLKRIFLASDMNIVLKDPDSNPYSLLDNTESDQ TADTDASESHHSTNRRRRSRRRRTDEDAVLMDGMTESDTASVNENGLDDSEKKPQRRNRS RRRRFRGQAEDRQPAIDFIYKEVEKVVSLWQAKDVIEEHGPSEKAINGPTSASGDDISKL QRTPGEEKINTLKEENTQEAAVLNGVS >gi568815595r:180884643_181088230|GENSCAN_predicted_CDS_2|2064_bp atggcggagctgacggtggaggttcgcggctctaacggggctttctacaagggatttatc aaagatgttcatgaagactcccttacagttgtttttgaaaataattggcaaccagaacgc caggttccatttaatgaagttagattaccaccaccacctgatataaaaaaagaaattagt gaaggagatgaagtagaggtatattcaagagcaaatgaccaagagccatgtgggtggtgg ttggctaaagttcggatgatgaaaggagaattttatgtcattgaatatgctgcttgtgac gctacttacaatgaaatagtcacatttgaacgacttcggcctgtcaatcaaaataaaact gtcaaaaaaaataccttctttaaatgcacagtggatgttcctgaggatttgagagaggcg tgtgctaatgaaaatgcacataaagattttaagaaagcagtaggagcatgcagaattttt taccatccagaaacaacacagctaatgatactgtctgccagtgaagcaactgtgaagaga gtaaacatcttaagtgacatgcatttgcgaagtattcgtacgaagttgatgcttatgtcc agaaatgaagaggccactaagcatttagaatgcacaaaacaacttgcagcagcttttcat gaggaatttgttgtgagagaagatttaatgggcctggcaataggaacacatggtagtaac atccagcaagctaggaaggttcctggagttaccgccattgagctagatgaagatactgga acattcagaatctacggagagagtgctgatgctgtaaaaaaggctagaggtttcttggaa tttgtggaggattttattcaggttcctaggaatctcgttggaaaagtaattggaaaaaat ggcaaagttattcaagaaatagtggacaaatctggtgtggttcgagtgagaattgaaggg gacaatgaaaataaattacccagagaagacggtatggttccatttgtatttgttggcact aaagaaagcattggaaatgtgcaggttcttctagagtatcatattgcctatctaaaggaa gtagaacagctaagaatggaacgcctacagattgatgaacagctgcgacagattggtatg ggtttcagaccttcttccaccagagggcctgaaaaagagaaaggatatgccactgatgaa agtaccgtctcttctgtacaaggttctaggtcttatagcggaagaggcagaggtcgtcgg ggacctaattacacctccggttatggtacaaattctgagctgtctaacccctctgaaacg gaatctgagcgtaaagacgagctgagtgattggtcattggcaggagaagatgatcgagac agccgacatcagcgtgacagcaggagacgcccaggaggaagaggcagaagtgtttcaggg ggtcgaggtcgtggtggaccacgtggtggcaaatcctccatcagttctgttcaatataga tcaaacattcataattgcagtactcttaaacgaatatttcttgcctctgacatgaatata gtgctcaaagatccagacagcaatccatacagcttacttgataatacagaatcagatcag actgcagacactgatgccagcgaatctcatcacagtactaaccgtcgtaggcggtctcgt agacgaaggactgatgaagatgctgttctgatggatggaatgactgaatctgatacagct tcagttaatgaaaatgggctagatgatagtgaaaaaaaaccccagcgacgcaatcgtagc cgcaggcgtcgcttcaggggtcaggcagaagatagacagccagctatagatttcatttat aaagaagttgaaaaagttgtctccctttggcaggcaaaagatgtgattgaagagcatggt ccttcagaaaaggcaataaacggcccaactagtgcttctggcgatgacatttctaagcta cagcgtactccaggagaagaaaagattaataccttaaaagaagaaaacactcaagaagca gcagtcctgaatggtgtttcataa >gi568815595r:180884643_181088230|GENSCAN_predicted_peptide_3|232_aa MLKVMEFQASNTCRARADKKLENRAERRCKNTNPQESEPTGLCPVALRSTTSTPHLRGGE LRLRPGPEVAKKTGRPHSPWLRLGSLASTGSTAHPSSEAAANTCTPLPESDATPNLKHRR PTQHGRSSRKLGRKTVVKGDAEEKESRSSSGQSGESDFGLVEGGWYRFFHSTFNAPFRSS CLLEFAGGALQTLFACLSPTEAAEQQRLLSVTSSGSFIPEGHLPDASRSSPL >gi568815595r:180884643_181088230|GENSCAN_predicted_CDS_3|699_bp atgctgaaggtgatggagttccaggcaagcaacacctgcagagcccgtgcggacaagaag ctggagaaccgtgcagaaagacgatgtaaaaacacaaacccccaagagagcgagccaaca gggttgtgtcctgtcgcactaaggagcacaacttcaactccacacctccgaggcggggag ctgaggttgaggcctgggccggaggtcgccaagaagaccggaaggccgcactcaccatgg ctccggctgggctcccttgcttccaccgggagcacggctcatcccagctcagaggccgcg gccaacacctgcacgcctttaccagagagcgacgcaacccccaacctcaagcacaggcgc cctacgcaacacggcaggagcagccgcaaactaggccggaaaactgtcgtaaaaggggac gcggaggagaaggaaagtcgctcaagcagtggacagagcggagaaagcgacttcgggttg gttgaaggtggctggtaccgatttttccattccacgtttaatgctcccttcaggagctct tgtctgctggagtttgctggaggtgcactccagactctgtttgcctgcctatcaccaaca gaggctgcagaacagcaaagattgctgtctgtaacttcctctggaagcttcatcccagag gggcacctgccagatgccagccggagctctcctctatga >gi568815595r:180884643_181088230|GENSCAN_predicted_peptide_4|58_aa MGLHGDSTRVSGEAFSSLSPGLNARPSDCGNYLSLMFQKEKNSFLLDFLRYFSCDKQI >gi568815595r:180884643_181088230|GENSCAN_predicted_CDS_4|177_bp atgggcttacatggagattccactcgtgtgagtggagaggctttttcttcactgagccct ggtctaaatgctagacctagtgactgtggcaactatctctcactgatgttccagaaagag aagaactccttcctgctagactttcttcgatatttcagttgtgataagcagatttaa >gi568815595r:180884643_181088230|GENSCAN_predicted_peptide_5|323_aa MCDLAYTSIGQDTEIWNKFQGSSLRKFPVIHAPIHTRQRPEESPVQMSGALLVAAPLSVN SAPQILAAKVFLIYHFNARRPQGSASVFAPCAVETTSRKTIGVISQACLTRAPEGCTKHE KEQPVPATAKTCQIVKTINFGKKLHQLMSKITRTLHPKSTEYTFFSAPHRTYSKIDYIVG SKELLSKCKRTEIITNCLSNHSAIKLQLRIKKLTQNRSTTWKLNNLLLNDYLVHNEMKAE IKMFFETNENKDITYQNLWDTFKAVCRGKFIALNAHRRKQERSKIDTLTSQLKELEKQEQ THSKASRRQEIAKIKAELKEIET >gi568815595r:180884643_181088230|GENSCAN_predicted_CDS_5|972_bp atgtgtgacctggcctacacaagcattggccaagatactgaaatctggaataaattccaa gggtcaagcctcaggaagtttcctgtcatacatgcacctattcatactcgtcaaagacct gaggagagccctgtgcaaatgtctggagctctcttagttgcagctcccctttctgtaaac tctgccccgcaaattctagctgccaaggttttcctgatctatcacttcaatgcaagaaga ccacaaggctcagcttcagtttttgctccctgcgctgtggaaactacctccaggaagaca attggtgtcatcagccaggcttgccttacaagagctcccgaaggatgcactaaacatgaa aaggaacaaccggtaccagccactgcaaaaacatgccaaattgtaaagaccatcaatttt gggaagaaactgcatcaactaatgagcaaaataaccagaactctccaccccaaatcaaca gaatatacattcttctcagcaccacatcgcacttattccaaaattgactacatagttgga agtaaagaactcctcagcaaatgtaaaagaacagaaattataacaaactgtctttcaaac cacagtgcaatcaaattacaactgaggattaagaaactcactcaaaaccgctcaactaca tggaaactgaacaacctgctcctgaatgactacctggtacataatgaaatgaaggcagaa ataaagatgttctttgaaaccaatgagaacaaagacataacataccagaatctctgggac acatttaaagcagtgtgtagagggaaatttatagcattaaatgcccacaggagaaagcag gaaagatctaaaattgacaccctaacatcacaattaaaagaactagagaagcaagagcaa acacattcaaaagctagcagaaggcaagaaatagctaagatcaaagcagaactgaaggag atagagacatga >gi568815595r:180884643_181088230|GENSCAN_predicted_peptide_6|120_aa XLTTPPSLYQILPWTALKLDQEAGARLPLVGKAPQIAFGSTVVSGKGTIKERKLERDMDE ARSHHPQQINTGTENQTPHVLTREDELTVVLRKPEPRSKPLAITFPKRVKTRNENVNQTC >gi568815595r:180884643_181088230|GENSCAN_predicted_CDS_6|363_bp nngctgactacccctccatccctttatcaaatcttgccctggactgcattaaagttagat caggaagcaggagctcgtctccccttggttgggaaagctccacaaatagcttttgggtct acagtggtgagtgggaagggcactatcaaggaaaggaaattggaaagggacatggatgaa gctagaagccatcatcctcagcaaataaacacaggaacagaaaaccaaacaccgcatgtt ctcactcgggaggatgaactcacagttgttttgcgaaagccagaaccccgctccaagcct ctagcaataactttcccaaaaagagtcaagacacgaaatgaaaatgtcaatcaaacatgt tga