GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:52:12 Sequence gi568815580f:41855292_42167513 : 312222 bp : 37.38% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 3203 3198 6 1.05 1.03 Term - 10934 10381 554 0 2 58 49 389 0.751 25.49 1.02 Intr - 11660 11496 165 0 0 32 44 122 0.542 1.21 1.01 Init - 11789 11687 103 2 1 60 82 112 0.818 6.30 1.00 Prom - 26764 26725 40 -2.65 2.04 PlyA - 26854 26849 6 1.05 2.03 Term - 32554 32534 21 1 0 116 48 24 0.049 -1.47 2.02 Intr - 41150 41060 91 1 1 68 92 74 0.162 4.88 2.01 Init - 58540 58443 98 1 2 58 31 114 0.011 2.53 2.00 Prom - 62360 62321 40 -3.85 3.00 Prom + 70105 70144 40 -3.05 3.01 Init + 75407 75518 112 1 1 72 92 14 0.208 0.71 3.02 Intr + 99966 100068 103 1 1 59 58 142 0.583 6.61 3.03 Intr + 102279 102467 189 0 0 78 100 101 0.996 8.08 3.04 Intr + 107198 107341 144 2 0 70 102 96 0.953 7.68 3.05 Intr + 115036 115165 130 1 1 104 93 91 0.984 10.98 3.06 Intr + 132482 132607 126 1 0 94 90 32 0.518 3.96 3.07 Intr + 135168 135263 96 2 0 81 63 96 0.972 5.69 3.08 Intr + 137979 138050 72 2 0 65 115 69 0.984 5.98 3.09 Intr + 140599 140703 105 0 0 18 64 111 0.694 1.19 3.10 Intr + 141347 141439 93 1 0 98 87 65 0.989 6.64 3.11 Intr + 149065 149250 186 0 0 64 42 184 0.986 10.36 3.12 Intr + 158151 158305 155 2 2 125 52 131 0.997 11.15 3.13 Intr + 172152 172257 106 0 1 46 86 107 0.880 5.60 3.14 Intr + 174034 174150 117 0 0 70 121 135 0.996 14.74 3.15 Intr + 178535 178666 132 1 0 97 25 96 0.877 4.12 3.16 Intr + 182401 182529 129 0 0 69 61 99 0.890 5.27 3.17 Intr + 183490 183559 70 0 1 40 115 86 0.918 4.34 3.18 Intr + 202592 202760 169 0 1 36 115 132 0.749 8.78 3.19 Intr + 209449 209539 91 1 1 119 86 65 0.998 8.48 3.20 Intr + 212097 212222 126 2 0 18 111 186 0.865 13.86 3.21 Intr + 243342 243461 120 2 0 45 87 116 0.389 6.97 3.22 Intr + 252573 252621 49 2 1 87 80 48 0.281 1.23 3.23 Term + 258217 258338 122 2 2 120 44 72 0.636 3.66 3.24 PlyA + 258848 258853 6 1.05 4.00 Prom + 260314 260353 40 -5.45 4.01 Sngl + 268441 268791 351 0 0 71 45 167 0.888 6.50 4.02 PlyA + 269126 269131 6 1.05 5.03 PlyA - 270466 270461 6 1.05 5.02 Term - 285386 285255 132 2 0 57 42 78 0.223 -2.79 5.01 Init - 286235 286152 84 2 0 85 68 64 0.242 4.97 5.00 Prom - 292958 292919 40 -1.35 6.06 PlyA - 293031 293026 6 1.05 6.05 Term - 295083 294676 408 0 0 4 37 227 0.331 3.43 6.04 Intr - 297938 297845 94 1 1 48 32 55 0.201 -5.05 6.03 Intr - 299972 299864 109 0 1 82 93 95 0.221 7.82 6.02 Intr - 309405 309316 90 1 0 119 41 62 0.153 3.65 6.01 Init - 309973 309943 31 1 1 80 98 49 0.954 5.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:41855292_42167513|GENSCAN_predicted_peptide_1|273_aa MKPRTLAVSVTALKVARLESVPFDVQMCSEFLPSGVRLQTFAVSVTALKAARLELFVPPG GLMVSLASGEKLQIFSVSVIAHKSSVDPKNSGAQLASLSGSRTGAAGGAACQSRAVHSYS SALGWSMGLGAMEQGVVLVGEAPAAQEPMEWVEGSGMAGCRSRALPCGKAAKARREIEHS AGEPALLENPVHPLQPLARVPSSSLPGAGRAGRLLRVWGPPSPCPPGTPAGLQALHAAPV PARASPSTPPCKLREPAVALASPERGSHSAAVG >gi568815580f:41855292_42167513|GENSCAN_predicted_CDS_1|822_bp atgaagccgcggaccctcgcggtgagtgttacagcccttaaggtggcgcgtctggagtct gtcccttttgatgttcagatgtgttcggagtttcttccttctggagtgaggctgcagacc ttcgctgtgagtgttacagctcttaaggcagcacgtctggagttgttcgttcctcctggt gggctcatggtctcgctggcttcaggagagaagctgcagatcttctcggtgagtgttata gctcataaaagcagtgtggacccaaagaactcaggagcccagctggcttcacttagtgga tcccgcaccggggctgcaggtggagctgcctgccagtcccgcgccgtgcactcatactcc tcagcccttgggtggtcgatgggactgggcgccatggagcagggggtggtgctagtcggg gaggctccggccgcacaggagcccatggagtgggtggaaggctcaggcatggcgggctgc aggtcccgagccctgccctgcgggaaggcagctaaggcccggcgagaaatcgagcacagc gccggtgagccggcactgctggagaacccagtacatcctctgcagccgctggcccgggtg ccaagttcctcactgcccggggccggcagggccggccggctgctccgagtgtggggcccg ccaagcccatgcccacctggaactccagctggcctgcaagcgctgcacgcagcaccggtt cccgctcgcgcctctccctccacgcctccctgcaagctgagggagccggctgtggccttg gccagcccagaaaggggctcccatagtgcagcggtgggctga >gi568815580f:41855292_42167513|GENSCAN_predicted_peptide_2|69_aa MQQQAAILGADSSPRPDTNPASTLVLDSQAPELLLVLQHALLAVRGTCEIISEERRKDMK MRETHKQLS >gi568815580f:41855292_42167513|GENSCAN_predicted_CDS_2|210_bp atgcagcaacaagctgccatcctgggagcagacagcagccctcggccagacactaaccct gcaagcaccttggtattagactcccaggctccagaattactcttggtcctgcagcatgct ctactcgctgtgcgtgggacctgtgagataatcagtgaagagagacggaaggatatgaaa atgagagagactcacaagcaactcagttaa >gi568815580f:41855292_42167513|GENSCAN_predicted_peptide_3|913_aa MAHIGLLGAHPVNQGSPKGGASSWAELVTSSCPLQIVVPAVGGTFADGAMGEAEKFHYIY SCDLDINVQLKIGSLEGKREQKSYKAVLEDPMLKFSGLYQETCSDLYVTCQVFAEGKPLA LPVRTSYKAFSTRWNWNEWLKLPVKYPDLPRNAQVALTIWDVYGPGKAVPVGGTTVSLFG KYGMFRQGMHDLKVWPNVEADGSEPTKTPGRTSSTLSEDQMSRLAKMYIKIFVIVFILQL TKAHRQGHMVKVDWLDRLTFREIEMINESEKRSSNFMYLMVEFRCVKCDDKEYGIVYYEK DGDESSPILTSFELVKVPDPQMSMENLVESKHHKLARSLRSGPSDHDLKPNAATRDQLNI IVSYPPTKQLTYEEQDLVWKFRYYLTNQEKALTKFLKCVNWDLPQEAKQALELLGKWKPM DVEDSLELLSSHYTNPTVRRYAVARLRQADDEDLLMYLLQLVQALKYENFDDIKNGLEPT KKDSQSSVSENVSNSGINSAEIDRYVIVECEDQDTQQRDPKTHEMYLNVMRRFSQALLKG DKSVRVMRSLLAAQQTFVDRLVHLMKAVQRESGNRKKKNERLQALLGDNEKMNLSDVELI PLPLEPQVKIRGIIPETATLFKSALMPAQLFFKTEDGGKYPVIFKHGDDLRQDQLILQII SLMDKLLRKENLDLKLTPYKVLATSTKHGKLFHIDFGYILGRDPKPLPPPMKLNKEMVEG MGGTQSEQYQEFRKQCYTAFLHLRRYSNLILNLFSLMVDANIPDIALEPDKTVKKVQDKF RLDLSDEEAVHYMQSLIDESVHALFAAVVEQIHKFAQKQLSDGTATRGHWIKGAAFLNIE DTGTIGDGDDSVNAKGAILLAVSPLSQELRVKHDKETSAQVLRQLNQGYMGVDCCQSQLR KKAKCVFDAILPS >gi568815580f:41855292_42167513|GENSCAN_predicted_CDS_3|2742_bp atggcacacatagggctgctgggagctcaccctgtgaatcaagggtctcccaaaggaggg gcatcctcgtgggctgaactggtcaccagtagttgtccactacagatagtggttcccgct gtaggtggtacctttgcagacggtgcgatgggggaagcagagaagtttcactacatctat agttgtgacctggatatcaacgtccagcttaagataggaagcttggaagggaagagagaa caaaagagttataaagctgtcctggaagacccaatgttgaagttctcaggactatatcaa gagacatgctctgatctttatgttacttgtcaagtttttgcagaagggaagcctttggcc ttgccagtgagaacatcctacaaagcatttagtacaagatggaactggaatgaatggctg aaactaccagtaaaataccctgacctgcccaggaatgcccaagtggccctcaccatatgg gatgtgtatggtcccggaaaagcagtgcctgtaggaggaacaacggtttcgctctttgga aaatacggcatgtttcgccaagggatgcatgacttgaaagtctggcctaatgtagaagca gatggatcagaacccacaaaaactcctggcagaacaagtagcactctctcagaagatcag atgagccgtcttgccaagatgtatattaaaattttcgtcattgtatttattctgcagctc accaaagctcatcgacaaggacacatggtgaaagtagattggctggatagattgacattt agagaaatagaaatgataaatgagagtgaaaaacgaagttctaatttcatgtacctgatg gttgaatttcgatgtgtcaagtgtgatgataaggaatatggtattgtttattatgaaaag gacggtgatgaatcatctccaattttaacaagttttgaattagtgaaagttcctgacccc cagatgtctatggagaatttagttgagagcaaacaccacaagcttgcccggagtttaaga agtggaccttctgaccacgatctgaaacccaatgctgccacgagagatcagttaaatatt attgtgagttatccaccaaccaagcaacttacatatgaagaacaagatcttgtttggaag tttagatattatcttacgaatcaagaaaaagccttgacaaaattcttgaaatgtgttaat tgggatctacctcaagaggccaaacaggccttggaacttctgggaaaatggaagccgatg gatgtagaggactccttggagctgttatcctctcattacaccaacccaactgtgaggcgt tatgctgttgcccggttgcgacaggccgatgatgaggatttgttgatgtacctattacaa ttggtccaggctctcaaatatgaaaattttgatgatataaagaatggattggaacctacc aagaaggatagtcagagttcagtgtcagaaaatgtgtcaaattctggaataaattctgca gaaatagataggtatgtgatagtggaatgtgaagatcaagatactcagcagagagatcca aagacccatgagatgtacttgaacgtaatgagaagattcagccaagcattgttgaagggt gataagtctgtcagagttatgcgttctttgctggctgcacaacagacatttgtagatcgg ttggtgcatctaatgaaggcagtacaacgcgaaagtggaaatcgtaagaaaaagaatgag agactacaggcattgcttggagataatgaaaagatgaatttgtcagatgtggaacttatc ccgttgcctttagaaccccaagtgaaaattagaggaataattccggaaacagctacactg tttaaaagtgcccttatgcctgcacagttgttttttaagacggaagatggaggcaaatat ccagttatatttaagcatggagatgatttacgtcaagatcaacttattcttcaaatcatt tcactcatggacaagctgttacggaaagaaaatctggacttgaaattgacaccttataag gtgttagccaccagtacaaaacatggcaaactcttccacatagactttggatatattttg ggtcgggatccaaagcctcttcctccaccaatgaagctgaataaagaaatggtagaagga atggggggcacacagagtgagcagtaccaagagttccgtaaacagtgttacacggctttc ctccacctgcgaaggtattctaatctgattttgaacttgttttccttgatggttgatgca aacattccagatattgcacttgaaccagataaaactgtgaaaaaggttcaggataaattc cgcttagacctgtcggatgaagaggctgtgcattacatgcagagtctgattgatgagagt gtccatgctctttttgctgcagtggtggaacagattcacaagtttgcccagaagcagtta agtgacgggacagctactagaggacattggataaagggagcagcttttttaaacatagaa gataccggaaccataggtgatggtgatgattcagtaaatgcaaagggggctatcctgctg gccgtgtctcctctctcccaagagctacgtgtaaaacatgataaagaaacttctgcccaa gtgttaaggcagcttaaccagggttacatgggtgtagactgttgccaatctcaactacga aaaaaagcaaagtgtgtctttgatgcaattttgccttcttga >gi568815580f:41855292_42167513|GENSCAN_predicted_peptide_4|116_aa MVLYRENSMISDQKLLKLISNFRKVSGYKINVQKSQAFLYTNSRQAESQIMKVLPFTTAT MRINYLGIQLTREVNDLFKENYKPLLKETREDTNKWKNIPCSWIGRINYRENGHTA >gi568815580f:41855292_42167513|GENSCAN_predicted_CDS_4|351_bp atggtcctatatcgagaaaactccatgatttcagaccaaaagcttcttaagctgataagc aacttcagaaaagtctcaggatacaaaatcaatgtgcagaagtcacaagcattcctatac accaatagcagacaagcagagagccaaatcatgaaagtactcccattcacaactgctaca atgagaataaattacctaggaatacagctaacaagggaagtgaacgacctcttcaaggag aactacaaaccactgctcaaggaaaccagagaggacacaaacaaatggaaaaatattcca tgctcatggataggaagaattaattatcgtgaaaatggccatactgcatga >gi568815580f:41855292_42167513|GENSCAN_predicted_peptide_5|71_aa MVKAGLDRSICQSTNGRHKHQHQGRIQQACNRGGHNSTVYCEVLSTVLAVEVPILMQRRC SNLWPKTKIHA >gi568815580f:41855292_42167513|GENSCAN_predicted_CDS_5|216_bp atggtcaaggctggactggacaggtctatctgccagtccaccaatggcaggcacaagcac cagcaccaagggagaatccagcaggcctgcaacaggggagggcacaattccactgtctac tgtgaggtactttccacagttctggctgtggaggtgcctatcctgatgcagagaaggtgc tctaatctctggcccaagactaaaattcatgcatag >gi568815580f:41855292_42167513|GENSCAN_predicted_peptide_6|243_aa MAARHSSEAYRQNLKEYEAEVGESEGRLRTDHIGPHSGLRELSAAYPWRGSGEGFQTEQC HHVEQQQLGKEDCWHSLSMQIMTEDVHSLLKNLQNMPHELMKDDKYSKELEVLARAIRQE KEIKGIQLGKEEVELSLFADDMIAYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELSFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNETKEDTNKWKNI PCS >gi568815580f:41855292_42167513|GENSCAN_predicted_CDS_6|732_bp atggcagctcggcattccagtgaagcatacaggcagaacttaaaggaatatgaagcagaa gtgggagaaagtgaaggaagattgaggacagaccatattggaccccactctggtctccga gagctgtcagctgcttatccctggagagggagtggagagggattccagactgaacaatgc catcatgttgagcagcagcaattaggaaaagaagattgttggcattcactttcaatgcaa ataatgactgaagacgtgcactctctactgaagaatttacaaaacatgccacatgaatta atgaaagatgataagtacagcaaggagttagaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagttgaattgtccctgtttgcagat gacatgattgcatatctagaaaaccccatcgtctcagcccaaaatctccttaagctaata agcaacttcagcaaagtctcaggatataaaatcaacgtgcaaaaatcacaagcattctta tacaccaataacagacaaactgagagccaaatcatgagtgaactctcattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaatgaaacaaaagaggacacaaacaaatggaagaacatt ccatgctcatag