GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:27:02 Sequence gi568815592r:33473806_33677604 : 203799 bp : 50.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 3003 2998 6 1.05 1.08 Term - 17273 17067 207 2 0 4 54 127 0.227 -1.76 1.07 Intr - 24915 24856 60 0 0 90 71 49 0.348 2.33 1.06 Intr - 29002 28717 286 0 1 66 45 175 0.395 8.34 1.05 Intr - 38702 38636 67 0 1 105 61 8 0.098 -2.24 1.04 Intr - 45686 45560 127 2 1 121 92 10 0.031 4.95 1.03 Intr - 59054 58870 185 0 2 58 35 100 0.031 1.21 1.02 Intr - 59949 59863 87 1 0 106 101 -13 0.075 1.74 1.01 Init - 65999 65930 70 2 1 52 110 48 0.361 4.62 1.00 Prom - 70558 70519 40 -3.56 2.03 PlyA - 72988 72983 6 1.05 2.02 Term - 94021 94006 16 0 1 121 44 5 0.600 -2.89 2.01 Init - 95570 95494 77 2 2 65 105 65 0.813 6.46 2.00 Prom - 96134 96095 40 -3.06 3.11 PlyA - 96871 96866 6 1.05 3.10 Term - 100102 99998 105 1 0 141 41 135 0.905 12.51 3.09 Intr - 100409 100229 181 1 1 68 92 228 0.999 21.07 3.08 Intr - 101636 101493 144 1 0 115 80 302 0.991 31.50 3.07 Intr - 102123 101988 136 1 1 71 99 102 0.990 9.23 3.06 Intr - 110962 110902 61 1 1 120 94 24 0.001 4.51 3.05 Intr - 112693 112559 135 1 0 132 68 58 0.002 8.96 3.04 Intr - 113039 112959 81 2 0 66 102 12 0.001 0.23 3.03 Intr - 114530 114379 152 0 2 57 59 72 0.001 1.08 3.02 Intr - 119363 119259 105 0 0 89 76 63 0.127 5.39 3.01 Init - 119797 119446 352 1 1 38 70 180 0.166 6.61 3.00 Prom - 122320 122281 40 -7.66 4.00 Prom + 123388 123427 40 -5.06 4.01 Init + 147798 147886 89 2 2 93 84 175 0.941 17.71 4.02 Intr + 166679 166749 71 2 2 122 83 167 0.997 18.43 4.03 Intr + 181961 182082 122 0 2 102 91 237 0.994 25.61 4.04 Intr + 184127 184213 87 1 0 97 94 163 0.999 17.97 4.05 Intr + 184865 185023 159 1 0 61 64 304 0.991 25.48 4.06 Intr + 185216 185314 99 1 0 106 78 212 0.994 22.31 4.07 Intr + 185661 185744 84 2 0 104 74 96 0.983 9.82 4.08 Intr + 188723 188869 147 1 0 85 55 241 0.775 20.93 4.09 Intr + 189106 189201 96 0 0 106 83 142 0.994 15.71 4.10 Intr + 189695 189745 51 1 0 80 77 33 0.582 0.50 4.11 Intr + 189933 190075 143 2 2 36 80 205 0.978 13.65 4.12 Intr + 191065 191164 100 1 1 77 74 257 0.923 23.31 4.13 Intr + 191248 191408 161 0 2 100 80 453 0.999 44.49 4.14 Intr + 192030 192171 142 0 1 80 96 322 0.999 32.56 4.15 Intr + 193324 193485 162 0 0 107 85 326 0.999 34.37 4.16 Intr + 193987 194159 173 0 2 120 80 466 0.999 47.74 4.17 Intr + 194710 194829 120 1 0 84 46 203 0.999 15.31 4.18 Intr + 195151 195351 201 1 0 63 49 340 0.561 26.00 4.19 Intr + 196520 196771 252 2 0 114 27 522 0.999 45.25 4.20 Intr + 196866 197010 145 0 1 67 24 319 0.995 23.68 4.21 Intr + 197360 197501 142 1 1 132 65 320 0.984 34.03 4.22 Intr + 198224 198423 200 0 2 129 51 280 0.998 27.47 4.23 Intr + 199786 199915 130 0 1 127 74 213 0.999 24.07 4.24 Intr + 200403 200460 58 1 1 121 60 100 0.999 8.44 4.25 Intr + 201886 202051 166 1 1 101 66 361 0.980 35.16 4.26 Intr + 202963 203127 165 0 0 84 107 337 0.998 35.36 4.27 Intr + 203210 203284 75 1 0 132 69 139 0.995 16.11 4.28 Intr + 203699 203788 90 1 0 82 40 123 0.987 7.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 112865 113004 140 0 2 93 101 159 0.872 17.88 S.002 Term + 149950 150061 112 1 1 87 41 121 0.955 5.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:33473806_33677604|GENSCAN_predicted_peptide_1|362_aa MNGKGLKESLPAAQIDFARGITTGGTTGTCHCVRSWWVLGLTDLKNEAADPHDSGAQLAS PSGSRTGAAGGAACQSCAVRSHSSALGWSMGLAALEQGVVLVREARAAQEPMEWRYEQNF IFSNLNQSSCIGMDGLRERSWYQICRQLGTEVFTCMAHNTAGDGEGPTLGCSPGVTEDEA SGAGRPLRVWGHTHLELALARDRLPRLSLHIFLQAEGAGSDLGQPREGLPQCGGGLKGSS STVRVGVEAEQTPRASEGCQHDVTSHQHFGKPRQVHREAKGEEDEEDLYLVLEQDGNYKD VKKALKGIVEWRENTTKALVETLLEKLRLQSTGWRKGLPHAQARASQQSAGKQETTPRVP GE >gi568815592r:33473806_33677604|GENSCAN_predicted_CDS_1|1089_bp atgaatggaaaaggcttgaaggaaagcctgccagccgctcagattgactttgcaaggggg atcaccacaggtgggactacaggcacatgccactgtgtccggagttggtgggttcttggt ctgactgacttgaagaacgaagccgcggaccctcatgactcaggagcccagctggcttca cccagtggatcccgcactggggctgcaggtggagctgcctgccagtcctgcgccgtgcgc tcgcactcctcagcccttgggtggtcaatgggactggccgccctggagcagggggtggtg ctcgtccgggaggctcgggcggcacaggagcccatggagtggcgttatgaacagaacttt attttttccaacctaaaccaatcatcctgcattgggatggatggattgagggaaagatca tggtaccagatctgcaggcagctggggacagaagtcttcacttgcatggcccataacact gcgggggatggagaagggccgaccctcggatgctcacctggagtaacggaggatgaggcc agtggcgctggccggccactccgagtgtggggccacacccacctggaactcgcgctggcc cgtgatcgtctcccacgcctctccctccacatcttcctgcaagcagagggagccggctcc gacctcggccagcccagagaggggctcccacagtgcggcggcgggctgaagggctcttca agcacggtcagagtgggcgtcgaggccgagcagacaccaagagcgagcgagggctgccag cacgatgtcacctctcaccagcactttgggaagccaaggcaggtacacagagaagcaaag ggtgaagaagatgaagaggacctttatttagtgttagaacaggatggtaactataaagat gtgaaaaaagctctaaagggtattgtggagtggagggagaatactaccaaggccctggtt gagaccttattggagaagctgcggctgcagtccactggatggcggaaaggtctgccacat gcccaggccagagctagtcagcaatcagcaggcaagcaggaaacaacccccagggtgcca ggggaatga >gi568815592r:33473806_33677604|GENSCAN_predicted_peptide_2|30_aa MDVGRHQMDACESVNVRVAKCVCGICKAYS >gi568815592r:33473806_33677604|GENSCAN_predicted_CDS_2|93_bp atggacgtgggcaggcaccaaatggatgcatgtgagagtgtgaacgtgcgtgtggctaag tgtgtatgtggcatctgtaaagcctattcctag >gi568815592r:33473806_33677604|GENSCAN_predicted_peptide_3|483_aa MVPSPGSLLDSSLLALLASFSALSKLYYSNIIIWFLTARVCYGPPTTDCEFLEARELPSA QEAPRSAAGRPLRARPEGTEMWHGSAVLRCERLRRYARSAGAWGGLAVVAETCLLHFGLR LFPDSRSQAAPLHPADDGDKVPHKPLIWGKGEEMTQASQRYSGNPRTVTPGTGQAGWGGD EKKEEGPERADTKIQTLRCEKTKEGVGGLARAKRTWPSTMLTSTTVISGRDVEPARYHPS FALPSPGSSLFEGLLWQRQHESLRLHPLGTACKGQVFFLQLQDLDKVAANPKAQSEEQVA QDTEEVFRSYVFYRHQQEQEAEGVAAPADPEMVTLPLQPSSTMGQVGRQLAIIGDDINRR YDSEFQTMLQHLQPTAENAYEYFTKIATSLFESGINWGRVVALLGFGYRLALHVYQHGLT GFLGQVTRFVVDFMLHHCIARWIAQRGGWVAALNLGNGPILNVLVVLGVVLLGQFVVRRF FKS >gi568815592r:33473806_33677604|GENSCAN_predicted_CDS_3|1452_bp atggtgccttctcctggaagccttctggattcctccctgctggcattgcttgcttccttc tctgcactctccaaactatactacagcaacatcatcatctggttcttaactgcgcgggtc tgttacggccctcccaccactgactgtgagtttctggaggccagggagctgccttcagca caggaggccccgcgctccgccgccggtcgacccctgagggcgaggccagagggaacggaa atgtggcacggttcggcagttctcagatgcgagcgcctgcgcaggtacgcacgctccgct ggagcctggggtggcctggcagtcgtggccgagacgtgtttgctgcacttcggccttcga ctcttcccggactccaggtcccaggccgccccgctccaccctgcggatgatggagacaaa gtcccccacaagcccctcatatggggcaagggggaagaaatgacccaggcctctcagcgc tactcaggcaaccccaggactgtgactcctgggacagggcaggctggctggggaggggat gagaagaaggaggagggaccagaaagggcagataccaagattcaaacccttcgatgtgaa aagacaaaagaaggggttggtggtctggcacgggccaagcgtacctggccatccacaatg ctcacctccaccacggtgatctctggccgggacgtagagccagccaggtaccacccctcc ttcgccctgccttccccaggaagctcactctttgagggccttctctggcagcggcagcat gagagcctgcgtcttcacccactggggacagcatgcaaagggcaggtattcttccttcaa ctgcaggatctggataaagtggctgctaatcccaaagcacagtcagaggagcaggtagcc caggacacagaggaggttttccgcagctacgttttttaccgccatcagcaggaacaggag gctgaaggggtggctgcccctgccgacccagagatggtcaccttacctctgcaacctagc agcaccatggggcaggtgggacggcagctcgccatcatcggggacgacatcaaccgacgc tatgactcagagttccagaccatgttgcagcacctgcagcccacggcagagaatgcctat gagtacttcaccaagattgccaccagcctgtttgagagtggcatcaattggggccgtgtg gtggctcttctgggcttcggctaccgtctggccctacacgtctaccagcatggcctgact ggcttcctaggccaggtgacccgcttcgtggtcgacttcatgctgcatcactgcattgcc cggtggattgcacagaggggtggctgggtggcagccctgaacttgggcaatggtcccatc ctgaacgtgctggtggttctgggtgtggttctgttgggccagtttgtggtacgaagattc ttcaaatcatga >gi568815592r:33473806_33677604|GENSCAN_predicted_peptide_4|1210_aa MSEMSSFLHIGDIVSLYAEGSVNGFISTLGLVDDRCVVEPAAGDLDNPPKKFRDCLFKVC PMNRYSAQKQYWKAKQTKQDKEKIADVVLLQKLQHAAQMEQKQNDTENKKVHGDVVKYGS VIQLLHMKSNKYLTVNKRLPALLEKNAMRVTLDATGNEGSWLFIQPFWKLRSNGDNVVVG DKVILNPVNAGQPLHASNYELSDNAGCKEVNSVNCNTSWKINLFMQFRDHLEEVLKGGDV VRLFHAEQEKFLTCDEYKGKLQVFLRTTLRQSATSATSSNALWEVEVVHHDPCRGGAGHW NGLYRFKHLATGNYLAAEENPSYKGDASDPKAAGMGAQGRTGRRNAGEKIKYCLVAVPHG NDIASLFELDPTTLQKTDSFVPRNSYVRLRHLCTNTWIQSTNVPIDIEEERPIRLMLGTC PTKEDKEAFAIVSVPVSEIRDLDFANDASSMLASAVEKLNEGFISQNDRRFVIQLLEDLV FFVSDVPNNGQNVLDIMVTKPNRERQKLMREQNILKQVFGILKAPFREKGGEGPLVRLEE LSDQKNAPYQHMFRLCYRVLRHSQEDYRKNQEHIAKQFGMMQSQIGYDILAEDTITALLH NNRKLLEKHITKTEVETFVSLVRKNREPRFLDYLSDLCVSNHIAIPVTQELICKCVLDPK NSDILIRTELLVVCRLRPVKEMAQSHEYLSIEYSEEEVWLTWTDKNNEHHEKSVRQLAQE ARAGNAHDENVLSYYRYQLKLFARMCLDRQYLAIDEISQQLGVDLIFLCMADEMLPFDLR ASFCHLMLHVHVDRDPQELVTPVKFARLWTEIPTAITIKDYDSNLNASRDDKKNKFANTM EFVEDYLNNVVSEAVPFANEEKNKLTFEVVSLAHNLIYFGFYSFSELLRLTRTLLGIIDC VQGPPAMLQAYEDPGGKNVRRSIQGVGHMMSTMVLSRKQSVFSAPSLSAGASAAEPLDRS KFEENEDIVVMETKLKILEILQFILNVRLDYRISYLLSVFKKEFVEVFPMQDSGADGTAP AFDSTTANMNLDRIGEQAEAMFGVGKTSSMLEVDDEGGRMFLRVLIHLTMHDYAPLVSGA LQLLFKHFSQRQEAMHTFKQVQLLISAQDVENYKVIKSELDRLRTMVEKSELWVDKKGSG KGEEVEAGAAKDKKERPTDEEGFLHPPGEKSSENYQIVKGILERLNKMCGVGEQMRKKQQ RLLKNMDAHK >gi568815592r:33473806_33677604|GENSCAN_predicted_CDS_4|3630_bp atgagtgaaatgtccagctttcttcacatcggggacatcgtctccctgtacgccgagggc tccgtcaatggcttcatcagcactttggggctggtggatgaccgctgtgtggtggagccc gcggccggggacctggacaacccccctaagaagttccgtgactgcctcttcaaggtgtgc cccatgaaccgctactcggcccagaagcagtactggaaggccaagcagactaagcaggac aaggagaagatcgctgatgtggtgttgctgcagaagctgcagcatgcggcgcagatggag cagaagcaaaatgacacggagaacaagaaggtgcatggggatgtcgtgaagtatggcagt gtgatccagctcctgcacatgaagagcaacaagtacctgacagtgaacaagcggcttccg gccttgctggagaagaacgccatgcgggtgactctggatgccacaggcaacgagggttcc tggctcttcatccagcccttctggaagctgcggagcaacggggacaacgtggtcgtgggg gacaaggtgatcctgaatcctgtcaatgccgggcagcctctgcatgccagcaattacgag ctcagcgacaacgccggctgcaaggaggtcaattctgtgaactgcaacaccagctggaag atcaacctgtttatgcagtttcgggaccacctggaggaggtgttgaaagggggagacgtg gtgcggctgttccatgcggagcaggagaagttcctgacgtgtgacgagtacaagggcaag ctgcaggtgttcctgcgaactacactgcgccagtctgccacctcggccaccagctccaat gctctctgggaggtggaggtggtccaccacgacccctgccgtggaggagctgggcactgg aatggcttgtaccgcttcaagcacctggctacaggcaactacctggctgctgaggagaac cccagttacaaaggtgatgcctcagatcccaaggcagcaggaatgggggcacagggccgc acaggccgcaggaatgctggggagaagatcaagtactgcctggtggctgtgcctcatggc aatgacatcgcctctctctttgagctggaccccaccaccttgcagaaaaccgactctttc gtgccccggaactcgtacgtccggctgcggcacctctgcaccaacacgtggattcagagc accaatgtgcccattgacatcgaggaggagcggcccatccggctcatgctgggcacctgc cccaccaaggaggacaaggaggcctttgccatcgtgtcagtgcccgtgtctgagatccga gacctggactttgccaatgacgccagctccatgctggccagtgccgtggagaaactcaac gagggcttcatcagccagaatgaccgcaggtttgtcatccagctgctggaagacctggtg ttctttgtcagcgatgtccccaacaatgggcagaatgtcctggacatcatggtcactaag cccaaccgggaacggcagaagctgatgagggagcagaacatcctcaaacaggtctttggc attctgaaggccccgttccgtgagaaggggggtgaaggtcccctggtgcggctggaggag ctgtcagaccagaagaacgccccctaccagcacatgttccgcctgtgctaccgcgtgttg cggcattcccaggaggactaccgcaagaaccaggagcacattgccaagcagtttgggatg atgcagtcccagattggctacgacatcctggccgaggacaccatcactgccctgctgcac aacaaccgcaagctcctggaaaagcacatcaccaagaccgaggtggagaccttcgtcagc cttgtgcgcaagaaccgggagcccaggttcctggactacctctctgacctgtgtgtgtcc aaccacatcgccatccccgtcacccaagagctcatctgcaagtgtgtgctggaccccaag aacagtgacattctcatccggaccgagttgctggtggtctgcaggcttcggcccgtgaag gagatggcccaatcccacgagtacctgagcatcgagtactcagaagaggaagtgtggctc acgtggactgacaagaataacgagcatcatgagaagagtgtgaggcagctggcccaggag gcgcgggccggcaacgcccacgacgagaatgtgctcagctactacaggtaccagctgaag ctctttgcccgcatgtgcttggaccgccagtacttggccatcgacgagatctcccagcag ctgggcgtggacctgattttcctgtgcatggcagacgagatgctgccctttgacctgcgc gcctccttctgccacctgatgctgcacgtgcacgtggaccgtgacccccaggagctggtc acgccggtcaagtttgcccgtctctggactgagatccccacagccatcaccatcaaggac tatgattccaacctcaacgcgtcccgagatgacaagaagaacaagtttgccaacaccatg gagttcgtggaggactacctcaacaatgtagtcagcgaggccgtgccctttgccaacgag gagaagaacaagctcacttttgaggtggtcagcctggcgcacaatctcatctacttcggc ttctacagcttcagcgagctgctgcggctcactcgcacactgctgggcatcatcgactgt gtgcaggggcccccggccatgctgcaggcctatgaggaccccggtggcaagaatgtgcgg cggtccatccagggcgtggggcacatgatgtccaccatggtgctgagccgcaagcagtcc gtcttcagtgcccccagcctgtctgctggggccagtgctgctgagccgctggacagaagc aagtttgaggagaatgaggacattgtggtgatggagaccaagctgaagatcctggaaatc cttcagttcatcctcaatgtccgcctggattaccgcatatcctacctgctgtctgtcttc aagaaggagtttgtggaggtgtttcccatgcaggacagtggggctgatggcacagcccct gccttcgactctaccactgccaacatgaacctggatcgcatcggggagcaggcggaggcc atgtttggagtggggaagacaagcagcatgctggaggtggatgacgagggcggccgcatg ttcctgcgcgtgctcatccacctcaccatgcacgactatgcgccgctggtctcgggtgcc ctgcagctgctcttcaagcacttcagccagcgccaggaggccatgcacaccttcaagcag gttcagctgctgatctcagcgcaggacgtggagaactacaaggtgatcaagtcggagctg gaccggctgcggaccatggtggagaagtcagagctgtgggtggacaagaagggcagtggc aagggtgaggaggtggaggcaggcgccgccaaggacaagaaagagcgtcccacggacgag gagggctttctgcacccaccaggggagaaaagcagtgagaactaccagatcgtcaagggc atcctggaaaggctgaacaagatgtgcggggttggggagcaaatgaggaagaagcagcaa cggctgctgaagaacatggatgcccacaag