GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:25:40 Sequence gi568815591f:30652151_30855848 : 203698 bp : 48.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.27 PlyA - 452 447 6 1.05 1.26 Term - 1450 1310 141 1 0 116 46 149 0.991 11.43 1.25 Intr - 2930 2889 42 2 0 143 64 79 0.998 9.44 1.24 Intr - 3565 3430 136 0 1 111 75 222 0.996 23.77 1.23 Intr - 3738 3620 119 0 2 2 68 57 0.529 -5.64 1.22 Intr - 3862 3777 86 2 2 116 91 89 0.527 11.54 1.21 Intr - 4433 4397 37 2 1 98 27 42 0.390 -3.06 1.20 Intr - 8495 8423 73 1 1 86 107 160 0.998 17.11 1.19 Intr - 10066 10006 61 2 1 96 98 133 0.953 12.89 1.18 Intr - 10697 10544 154 2 1 101 31 380 0.934 33.25 1.17 Intr - 13037 12920 118 1 1 119 26 205 0.786 17.87 1.16 Intr - 13489 13380 110 1 2 64 101 277 0.692 25.68 1.15 Intr - 15163 15078 86 2 2 73 113 104 0.953 10.94 1.14 Intr - 22370 22332 39 0 0 117 94 -9 0.007 0.80 1.13 Intr - 29890 29765 126 2 0 107 86 236 0.033 25.95 1.12 Intr - 30182 30028 155 1 2 100 61 183 0.567 16.52 1.11 Intr - 36492 36297 196 1 1 91 76 7 0.144 -1.73 1.10 Intr - 37134 37041 94 0 1 134 96 111 0.975 16.04 1.09 Intr - 43380 43351 30 0 0 95 94 7 0.520 0.33 1.08 Intr - 46320 46195 126 0 0 107 34 44 0.615 1.78 1.07 Intr - 47260 47109 152 2 2 122 81 -3 0.593 2.28 1.06 Intr - 47708 47562 147 0 0 2 68 135 0.452 3.11 1.05 Intr - 70888 70848 41 0 2 108 105 34 0.216 4.97 1.04 Intr - 79403 79223 181 0 1 66 75 52 0.277 0.63 1.03 Intr - 89703 89572 132 1 0 55 25 92 0.421 0.22 1.02 Intr - 90570 90188 383 2 2 45 71 281 0.646 16.65 1.01 Init - 92102 92017 86 2 2 89 36 65 0.475 1.76 1.00 Prom - 97983 97944 40 -6.06 2.00 Prom + 98114 98153 40 -3.86 2.01 Init + 100001 100154 154 1 1 100 93 185 0.856 20.34 2.02 Intr + 101581 101788 208 2 1 107 91 290 0.999 29.34 2.03 Term + 103272 103701 430 0 1 118 52 595 0.998 53.67 2.04 PlyA + 104819 104824 6 1.05 3.00 Prom + 110683 110722 40 -2.76 3.01 Init + 118844 118883 40 1 1 60 86 40 0.938 1.07 3.02 Intr + 119198 119406 209 0 2 88 92 144 0.916 13.60 3.03 Intr + 126282 126401 120 2 0 69 95 106 0.873 10.09 3.04 Intr + 129827 130062 236 1 2 73 70 9 0.174 -5.91 3.05 Intr + 133599 133842 244 0 1 78 75 118 0.337 7.00 3.06 Intr + 138650 138760 111 1 0 58 56 120 0.976 6.38 3.07 Intr + 139015 139424 410 0 2 105 109 170 0.531 13.87 3.08 Intr + 148058 148124 67 2 1 87 85 64 0.229 4.91 3.09 Intr + 151553 151692 140 1 2 58 64 13 0.001 -4.84 3.10 Intr + 176529 176587 59 0 2 95 86 62 0.153 5.23 3.11 Intr + 184508 184614 107 0 2 61 115 81 0.944 8.03 3.12 Intr + 187319 187413 95 1 2 59 68 86 0.780 2.46 3.13 Intr + 187944 188084 141 0 0 55 76 52 0.373 0.17 3.14 Intr + 188610 188698 89 0 2 94 89 64 0.936 6.71 3.15 Term + 189420 189487 68 1 2 131 42 53 0.962 3.10 3.16 PlyA + 189489 189494 6 -0.45 4.06 PlyA - 189729 189724 6 1.05 4.05 Term - 191672 191598 75 2 0 89 43 71 0.199 0.54 4.04 Intr - 194607 194401 207 0 0 50 61 95 0.325 2.27 4.03 Intr - 196574 196492 83 0 2 84 47 76 0.438 2.46 4.02 Intr - 201412 201249 164 0 2 91 68 26 0.512 0.52 4.01 Intr - 202667 202420 248 2 2 147 100 66 0.755 9.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 29890 29706 185 2 2 107 50 306 0.967 26.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:30652151_30855848|GENSCAN_predicted_peptide_1|1016_aa MSSKGKGPVVAGSRPCVEDSEKGAVRGVECPLKSACPDVDMYWAALQRLASLLKADGNLV TMAALHSHYMVGSKKFFGLHPEKETVEKAMQEAGCQVQRCAPSAALRHAPSTTASALWLP ARAPVSEASLAEEGLTERGSHDQSIYSLAPQLPLKAVQPKREASSEDSQVLAVGSSASML REWEEAARASATDMKCPVGWDRKRHNCLLFLQYCKGSHLFYRGSKGRKLLKTQSERPAIR LSDSQISPSLLPCRGGMTVERRLKWADELIHYLRGPGGWAGSGILVSDDELEIPCLGEGV SGPGGWAGSGILVSDDELEIPCLVPGIRQPPVPDERWGQSGLWENQPLQGFLKTQSFIQP HPVPLSRVRGHRWKIGAFQEIQVLGQGLPLSPSLWGPCCQPPKLVRRRKLLLAAPESVFR AQGSAQAAGQSQMPKDQPLWALLEQYCHTIMTLTNLSGLHPQDGSGGDPCSGLGPLPGHK ETRVLASTQTQPTGTAQARRQRESLAACSPSTGAVNSSGVGISHAGRALPLPLRTRLRDA MDAALLHSLLEANCSLALAEELLLDGWGPPLDPEGPYSYCNTTLDQIGTCWPRSAAGALV ERPCPEYFNGVKYNTTRGLPSSDLSSSAPGNAYRECLENGTWASKINYSQCEPILDDKQR KYDLHYRIALVVNYLGHCVSVAALVAAFLLFLALRSIRCLRNVIHWNLITTFILRNVMWF LLQLVDHEVHESNEVWCRCITTIFNYFVVTNFFWMFVEGCYLHTAIVMTYSTERLRKCLF LFIGWCIPFPIIVAWAIGKLYYENEQCWFGKEPGDLVDYIYQGPIILVLLGAYEGSGVPG HRDQFRISVQHRQDPNDKVTRVHHIRDNPVQEEVIGPSPCFLAEPSAPGLTQGARSSSRK KHHHCLPPPTGKAVKATLVLLPLLGITYMLFFVNPGEDDLSQIMFIYFNSFLQSFQGFFV SVFYCFFNGEVRSAVRKRWHRWQDHHSLRVPMARAMSIPTSPTRISFHSIKQTAAV >gi568815591f:30652151_30855848|GENSCAN_predicted_CDS_1|3051_bp atgagcagcaagggcaaaggcccagtggtggcggggagccgcccgtgtgtcgaggacagt gaaaagggtgctgtgcgaggggtggagtgtcccctgaagagtgcctgtcctgatgtggac atgtactgggcagccctgcagaggctggccagcctgctgaaagcagatgggaacctggtc accatggctgccctgcactcccactatatggtaggctccaagaagttctttgggctccac ccagagaaggagacggtggagaaggccatgcaggaggctggctgccaggtgcagaggtgt gccccatcagctgctctgagacatgctccatcaacgacggcatctgctttgtggctgcct gcaagggccccagtgtctgaggccagcctggcagaagagggcctgaccgagaggggaagt catgaccagagcatctacagtcttgccccacagctccccctaaaagcagtgcagcccaag agagaagcctcgagtgaagactcacaggtgctggctgtgggaagttcagccagcatgctg agggaatgggaagaggcagcaagagcatctgcaacagacatgaaatgtcctgttggctgg gaccgcaaaaggcataactgcttgcttttccttcagtactgcaaaggaagtcatcttttc taccgcggcagcaaaggcagaaagcttctgaaaacccagtcagagaggccagccattcgg ctttcagacagccagataagcccctcgctcctgccttgccgtgggggaatgacagtagaa aggaggctgaagtgggcagatgagctcattcactacctcagagggcctggtggctgggct ggatcggggatccttgtgtctgatgatgaattggagatcccgtgcctgggtgagggggtc tcagggcctggtggctgggctggatcggggatccttgtgtctgatgatgaattggagatc ccgtgcctggtgcctggaatcaggcagcccccagtaccagatgagcgatggggacagagt gggctctgggagaaccagcctctccaagggtttctcaagactcagagcttcatccagcca caccccgtgcctttgtccagagtcaggggccaccgatggaagataggtgccttccaggag atccaagtgttggggcaagggttgcccctcagcccgtccctctggggtccatgctgccag ccgcccaagctggttagacggaggaagcttctgctggctgctccagagtcagtgttcagg gcccagggctcggcacaggcagccgggcagagccagatgcccaaagaccagcccctgtgg gcacttctggagcagtactgccacaccatcatgaccctcaccaacctctcaggtctgcat ccacaggatgggagtgggggtgacccttgctcgggcctggggccactgccagggcacaag gagacaagggtgctggccagcacccagacccagcctacaggcactgcccaagccaggcgt caaagagaaagtcttgctgcctgttctccttccactggggcagtcaatagctccggtgtt gggatcagtcacgccgggcgcgcactcccactccctctccgcacgcggctgcgggacgcg atggacgcggcactgctccacagcctgctggaggccaactgcagcctggcgctggctgaa gagctgctcttggacggctgggggccacccctggaccccgagggtccctactcctactgc aacacgaccttggaccagatcggaacgtgctggccccgcagcgctgccggagccctcgtg gagaggccgtgccccgagtacttcaacggcgtcaagtacaacacgacccgtggtctcccc tcttcagacctcagttcatctgcaccagggaatgcctatcgagaatgcttggagaatggg acgtgggcctcaaagatcaactactcacagtgtgagcccattttggatgacaagcagagg aagtatgacctgcactaccgcatcgcccttgtcgtcaactacctgggccactgcgtatct gtggcagccctggtggccgccttcctgcttttcctggccctgcggagcattcgctgtctg cggaatgtgattcactggaacctcatcaccacctttatcctgcgaaatgtcatgtggttc ctgctgcagctcgttgaccatgaagtgcacgagagcaatgaggtctggtgccgctgcatc accaccatcttcaactacttcgtggtgaccaacttcttctggatgtttgtggaaggctgc tacctgcacacggccattgtcatgacctactccactgagcgcctgcgcaagtgcctcttc ctcttcatcggatggtgcatccccttccccatcatcgtcgcctgggccatcggcaagctc tactatgagaatgaacagtgctggtttggcaaggagcctggcgacctggtggactacatc taccaaggccccatcattctcgtgctcctgggtgcctacgaaggctctggagttcctggg cacagggatcaatttcgtatttctgttcaacatcgtcaggatcctaatgacaaagttacg cgcgtccaccacatccgagacaatccagtacaggaggaggtcatcgggccgtccccctgc ttcctggcagagccctctgccccaggcctcacccagggagctcgctcaagctcccggaag aaacaccaccactgtctcccgccgcccactgggaaggcagtgaaggccaccctggtgctc ctgcccctcctgggcatcacctacatgctcttcttcgtcaatcccggggaggacgacctg tcacagatcatgttcatctatttcaactccttcctgcagtcgttccagggtttcttcgtg tctgtcttctactgcttcttcaatggagaggtgcgctcagccgtgaggaagaggtggcac cgctggcaggaccatcactcccttcgagtccccatggcccgggccatgtccatccctaca tcacccacacggatcagcttccacagcatcaagcagacggccgctgtgtga >gi568815591f:30652151_30855848|GENSCAN_predicted_peptide_2|263_aa MKGGFTGGDEYQKHFLPRDYLATYYSFDGSPSPEAEMLKFNLECLHKTFGPGGLQGDTLI DIGSGPTIYQVLAACDSFQDITLSDFTDRNREELEKWLKKEPGAYDWTPAVKFACELEGN SGRWEEKEEKLRAAVKRVLKCDVHLGNPLAPAVLPLADCVLTLLAMECACCSLDAYRAAL CNLASLLKPGGHLVTTVTLRLPSYMVGKREFSCVALEKEEVEQAVLDAGFDIEQLLHSPQ SYSVTNAANNGVCFIVARKKPGP >gi568815591f:30652151_30855848|GENSCAN_predicted_CDS_2|792_bp atgaagggtggcttcactgggggtgatgagtaccagaagcacttcctgcccagggactac ttggctacttactacagcttcgatggcagcccctcacccgaggccgagatgctgaagttt aacttggaatgtctccacaagaccttcggccctggaggcctccaaggggacacgctgatt gacattggctcaggtcctaccatctaccaagttcttgctgcctgtgattccttccaagac atcactctctccgactttaccgaccgcaaccgggaggagctggaaaagtggctgaagaag gagccgggggcctatgactggaccccagcggtgaaattcgcctgtgagctggaaggaaac agcggccgatgggaggagaaggaggagaagctgcgggcagcggtgaagcgggtgctcaag tgcgatgtccacctgggcaacccgctggccccggctgtgttgcctctcgccgactgtgtg ctcaccctgctggccatggagtgtgcctgctgtagccttgatgcctaccgcgctgccctg tgcaaccttgcctcactgctcaagccgggtggccacctggtgaccactgtcacgcttcgg ctcccgtcctacatggtggggaagcgtgaattttcctgcgtggccctggagaaagaggag gtggagcaggctgtcctggatgctggctttgacattgaacagctcctacacagtccccag agctactctgtcaccaatgctgccaacaatggggtctgcttcattgtggctcgcaagaag cctgggccctga >gi568815591f:30652151_30855848|GENSCAN_predicted_peptide_3|711_aa MLGFLAPTVKARQARQPQGRGGVLAAPTPPSPRQRGHTAPDRPSCLVLRPGVGLVGRARA RAMDSLFVEEVAASLVREFLSRKGLKKTCVTMDQERPRSDLSINNRNDLRKVLHLEFLYK ENKAKENPLKTSLELITRYFLDHFGNTANNFTQDTPIPALSVPKKNNKVPSRCSETTLVN IYDLSDEDAGWRTSLSETSKARHDNLDGDVLGNFVSSKRPPHKSKPMQTVPGETPVLTSA WEKIDKLHSEPSLDVKRMGENSRPKSGLIVRGMMSGPIASSPQVRKSGTEKLSSLPQVTQ LVGGDADWSDGFTCCMNLLSDSFHRHYLRRSSPSSSSTQPQEESRKVPELFVCTQQDILA SSNSSPSRTSLGQLSELTVERQKTTASSPPHLPSKRLPPWDRARPRDPSEDTPAVDGSTD TDRMPLKLYLPGGNSRMTQERLERAFKRQGSQPAPVRYYSKDKLQSLSLDITLDALDALG GKFAGIPGVSFLKTEAALLFFPSVFFYGLHWVLLHLQTFLDNVHLWKNQLLPSDKVDGEL GALRLEDVEDELIREEVILSPVPSVLKLQTASKPIDLSVAKALQMWLVQQELGNKTLCFG DVKIEHGFKEHLLQSLNLDEVCLCCSVPLAFANVSQESDPVLSPVMPTEMSSSRSLCTTG VVLAESWQLSKAVSYRNSCLKEIAKPTVLSPVTSLYRVTLRATELLAIGLF >gi568815591f:30652151_30855848|GENSCAN_predicted_CDS_3|2136_bp atgctgggcttccttgcccccaccgtcaaggcccgccaagcccgccagccccagggaagg ggaggagttcttgccgcgccgacgccgccgtcgccacggcaacgcggccatactgcgccg gacagacccagttgcctggtgctgcggcccggcgtgggcctcgtgggcagagccagagcc agagccatggacagcctcttcgtggaggaggtggccgcctccttggtcagggagttcctc agcagaaagggcttaaagaagacatgtgtgaccatggaccaggaacgcccacgctctgac ctcagcataaacaacagaaatgatcttcgaaaggttttgcatcttgaatttctctataag gagaacaaggcaaaggaaaatcctctaaaaacaagccttgaactcatcaccagatacttt ctggatcactttggaaatacggctaacaatttcactcaagataccccaatccctgcactc tcagttccaaagaaaaataacaaagtgccatcaagatgctcagagactacactggtaaat atatatgacctttcagatgaagatgcaggatggagaacatcattgtcagaaacaagcaaa gccagacatgacaatcttgatggagatgtacttggtaattttgtatcatctaaaaggccc ccgcacaaaagtaagcccatgcagacggtcccgggtgaaactcctgtgttgacttctgca tgggagaagatagacaagcttcactcggagccttccttggatgtgaagaggatgggagag aattccaggccaaagtctggtctgattgtgcgaggcatgatgtctgggcccatcgccagc tccccacaggtgaggaaatcagggacagagaagttaagcagcttgccacaggtgacacag ttggtaggtggtgatgcagactggagcgacggcttcacctgctgcatgaacctgctctct gattcttttcacagacactatctgagacggtcctcaccgtcaagcagctccacccaaccc caagaagagagccggaaggtccctgagctctttgtctgcacccaacaggacattctggct tcgagcaacagctccccctccaggacctccctgggtcagcttagtgaactgaccgtagaa aggcagaaaaccactgccagcagccctccccatctgcccagcaaacggctgcccccatgg gacagggccaggccgagggatccctccgaggacaccccagcagtggacggcagcacagac acggacaggatgcccttgaagctctacttgcctggtggtaattccaggatgacccaggag aggctggaaagagcgttcaaacggcagggcagccagcccgcacctgtcaggtattacagt aaagataaactgcaaagcctgtccctggatatcaccttagatgcattggatgccctgggg ggaaagtttgctggtatccctggggtctcattcctaaagactgaagcagctctcttgttc ttccccagcgtgtttttctacggcctgcattgggttttgctgcatttgcagacatttttg gataatgtacacttatggaaaaatcagttgctgccgtctgacaaggtggatggtgagctg ggtgccctgcggctcgaggatgtggaggatgagttgataagggaagaggtcatcctgtcg ccagtcccatcagtgctcaagttgcagacagcatcaaaaccaattgacctctcagtagca aaggcacttcagatgtggctggtgcagcaagagctggggaataagaccctctgctttgga gatgttaaaattgaacatggatttaaggagcacttgctacaatccttaaacctggatgag gtctgtttgtgttgttcagtcccactagcctttgccaatgtgtcccaggagtcagatcct gtgctaagccccgtgatgcctacagagatgagctcctcaagaagcctctgcaccacaggg gtggtccttgcggagtcctggcagctgtccaaggctgtgtcctacagaaactcctgtttg aaggagatagcaaagccgactgtgctcagcccggttaccagcttgtaccgagtcaccctc agagctacagaacttctggcaattggccttttttaa >gi568815591f:30652151_30855848|GENSCAN_predicted_peptide_4|258_aa GRPHCLPKGHLVLPGPDHFPGLCPGLQGLWESSAEGSLTAGALVLVPFPASSPTLRLTTK PSSAFCGAKKNGSHIQKVKGKARVGRQENNSSSQHCSHSEIRRLSDAHPTGVRHASYLMN ALLKKSHQVLIAGQTVKRSPSCSLEEIVTKHSPNRMPQLLGALAQSMEKLSSTKPVPGAK KAGDRGFNPPQISQGCMCLRENLHPQMMTGKGPASESPRPLPDSSAHAFTGRTQPQDPLG CTILGLQLGGTCLAGNDR >gi568815591f:30652151_30855848|GENSCAN_predicted_CDS_4|777_bp ggtaggccccattgcctacccaagggacatctggtgctgccagggcctgatcatttccca gggctgtgcccaggccttcagggtctttgggaaagttctgcggaaggcagtctgacagca ggagctttagtcctggtcccatttccagcaagttctcccactctccggcttaccaccaaa ccctccagtgccttctgtggggccaaaaagaatggatcccacatccagaaggtcaaaggg aaggcaagggtgggacgacaggagaacaactccagttcacagcattgttcccacagtgaa atcagaaggcttagtgacgcacatcccacaggagtaaggcatgcttcatacctgatgaat gctttgttgaagaaaagtcaccaggtcctcatagcaggtcaaactgtgaagcgaagccca tcatgttccttggaggaaatagttaccaagcattcaccgaaccggatgccccagctgctg ggagcactggcccagtccatggaaaaattgtcttccacaaaaccagtccctggtgccaaa aaggctggggaccgtggatttaacccaccccagattagccagggctgtatgtgcctgcgt gagaacctgcacccacagatgatgacagggaaaggtcctgcgtctgaatcacccaggcct ttaccagatagctcagcgcacgccttcactggcagaacccagccccaggacccactgggc tgcaccatccttggcctgcagctaggaggtacctgcctggccggcaacgacagatag