GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:40:09 Sequence gi568815597f:53359454_53566659 : 207206 bp : 50.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4496 4535 40 -1.86 1.01 Init + 16663 16950 288 0 0 66 72 137 0.815 7.02 1.02 Intr + 17541 17683 143 2 2 85 99 15 0.642 1.45 1.03 Intr + 21651 21685 35 0 2 100 93 31 0.628 2.67 1.04 Term + 30138 30265 128 1 2 38 55 140 0.632 4.14 1.05 PlyA + 31943 31948 6 1.05 2.07 PlyA - 33912 33907 6 1.05 2.06 Term - 34497 34359 139 2 1 65 55 103 0.574 2.14 2.05 Intr - 36429 36363 67 1 1 69 94 42 0.187 0.86 2.04 Intr - 57888 57794 95 2 2 71 45 65 0.098 0.11 2.03 Intr - 67345 67229 117 0 0 65 109 56 0.587 4.98 2.02 Intr - 79267 78920 348 0 0 -8 56 455 0.175 27.17 2.01 Init - 79916 79375 542 2 2 27 17 932 0.843 74.35 2.00 Prom - 93427 93388 40 -1.86 3.00 Prom + 98471 98510 40 -6.06 3.01 Init + 99836 100577 742 1 1 63 111 784 0.449 73.11 3.02 Intr + 102020 102192 173 0 2 92 97 261 0.999 27.06 3.03 Intr + 105184 105394 211 0 1 109 105 182 0.986 20.49 3.04 Term + 107142 107209 68 1 2 121 47 58 0.934 3.10 3.05 PlyA + 108016 108021 6 1.05 4.06 PlyA - 108247 108242 6 1.05 4.05 Term - 123090 122902 189 0 0 73 48 171 0.792 9.05 4.04 Intr - 124343 124265 79 1 1 98 78 -3 0.682 -0.75 4.03 Intr - 126770 126650 121 0 1 41 99 64 0.511 2.35 4.02 Intr - 127271 127162 110 1 2 114 95 -16 0.163 1.63 4.01 Init - 133931 133822 110 2 2 66 77 66 0.211 2.99 4.00 Prom - 134805 134766 40 -6.96 5.19 PlyA - 135394 135389 6 1.05 5.18 Term - 147323 147166 158 0 2 90 49 218 0.625 16.20 5.17 Intr - 149834 149667 168 0 0 89 99 195 0.999 20.62 5.16 Intr - 150574 150396 179 0 2 130 121 55 0.998 12.56 5.15 Intr - 154465 154184 282 0 0 127 42 49 0.333 0.73 5.14 Intr - 155328 155172 157 1 1 100 71 92 0.948 7.77 5.13 Intr - 158801 158700 102 0 0 50 72 59 0.567 0.65 5.12 Intr - 161313 161181 133 0 1 108 98 269 0.999 30.12 5.11 Intr - 165434 165324 111 2 0 99 121 202 0.885 25.18 5.10 Intr - 165989 165888 102 2 0 75 61 50 0.586 1.37 5.09 Intr - 170492 170338 155 0 2 -11 78 247 0.762 13.59 5.08 Intr - 171956 171804 153 0 0 133 16 107 0.708 8.04 5.07 Intr - 174872 174822 51 0 0 55 115 34 0.117 1.68 5.06 Intr - 184342 184319 24 2 0 78 94 48 0.027 2.40 5.05 Intr - 186702 186651 52 0 1 50 72 61 0.001 -0.82 5.04 Intr - 193738 193643 96 1 0 50 75 65 0.032 1.61 5.03 Intr - 200262 200132 131 1 2 46 101 40 0.092 1.51 5.02 Intr - 200840 200762 79 2 1 86 93 53 0.241 4.72 5.01 Intr - 201090 200921 170 1 2 68 94 37 0.139 1.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:53359454_53566659|GENSCAN_predicted_peptide_1|197_aa MEEPEPLMMSSCQQPAGQGKQFCVGPLQDSQGTHTLAPSTAHQAAFLCSNRSSAVTIPEA RSLLIQIPPGQDFPLQLCCQMAPQLCPHTSRDTESRLLTKPAFKQGWEGERQESALAAAS PWGTKHCPSGKGREGAIQAEDEGRNKFTGSTETKNFLKAGKSKIEVLAGLVSGERLFLMD RNFLLCLHLVEGAKKAP >gi568815597f:53359454_53566659|GENSCAN_predicted_CDS_1|594_bp atggaggagcccgagcctttgatgatgagcagctgccagcagccggccgggcaagggaaa cagttctgcgttgggccactccaggactcccaggggacacacactcttgccccctccacc gcccaccaggcggccttcctctgctctaacagaagtagtgctgtgactattccagaggct cggagcctgctaatacagattccacccgggcaggacttcccactacagctttgctgccag atggcgccccagctgtgccctcacacctccagggacacggagagcaggctcctcactaaa cctgccttcaagcaaggatgggaaggagagcggcaggaaagcgccctggctgctgcctcc ccttggggcacaaagcactgcccctccgggaaggggcgagagggagcaatccaggcggag gatgaaggcaggaataagttcactggaagcacagaaacaaagaactttctgaaggctgga aagtccaagattgaggtgctggcaggtttggtgtctggcgagcgcctgttccttatggac cgcaacttcttgctgtgtcttcatttggtggaaggagcaaagaaggccccctga >gi568815597f:53359454_53566659|GENSCAN_predicted_peptide_2|435_aa MIPSRPPSAAGEKRGPRRQQTLGGGCGGGQRGVRLGPVPAAVRPGRDAELRLTHTAIVSL HLVKCRMQVDPGWYKGVLSGFGVTVHSDGLCGLARGCAQTFFGYSLQGFFKFGLYEVFKI RSAELLGPEKAYGWRAGLYLFAWASAEFFPDVSLEPMEAVKVRVQTRPGYASTLRATASR IGPLQVCRAQAPEPVHQGQAAGHHLRGVFCAVVSHPADSVVSVLNKEKGIAALGVLRRLG FTGVWKGVVARIGLFAHIPMIGTLTALQWFIYDLVKVYFKLPRPPVAQAPEPLKKKKASE AIGQCQSSAVKLRRSGKESVREPWARVPGALGVAVRPLPIHPMSVTELFYGAPSLSLETV AVLLLGSGTGDKGRAHSGGVAEPAADNYDRAPVRASSRVTHRSTLQIVFTYYDLSQATCE VSKHFNPILQGQAAP >gi568815597f:53359454_53566659|GENSCAN_predicted_CDS_2|1308_bp atgatcccctcgcggccgccgagcgctgcaggcgagaagcgtgggccaaggcgtcagcag accctgggcggcggctgtggaggaggccagcgtggagtacggctcggcccggtacctgct gctgtgcggcctgggcgggatgctgagctgcggctgacacacacggccatcgtgtccctg cacctggtcaagtgccgcatgcaggtggaccccggctggtacaagggcgtcctcagcggc ttcggcgtgacggtgcacagcgacgggctgtgcggcctggctcgcggctgcgctcagacc ttcttcggctactccctgcagggcttcttcaagttcggcctttacgaggtcttcaagatc cgctccgcggagctcctgggcccagagaaagcctacgggtggagagccggcctgtacctt ttcgcctgggccagcgccgagttcttccccgatgtctcgctggagcccatggaggcggtc aaggtccgcgtgcagacgcgaccgggctacgccagcaccctgcgagccacggcgtccagg ataggacctctacaggtatgccgtgcccaagccccagagccagtgcaccaaggccaagca gctgggcatcaccttcgtggcgtcttctgcgctgtcgtgtcccatcccgccgactccgtg gtgtctgtgctgaacaaggagaagggcatcgcggccctgggcgtcctccgaagactgggg ttcacgggcgtgtggaagggcgtcgttgcgcgcatcggcctctttgcgcacatccccatg atcggcaccctcacggccctgcagtggtttatctatgacctggtcaaggtctatttcaag ctgcctcgccctcccgtggctcaagcgcccgaacccctgaagaagaaaaaggcttccgag gcgatcgggcagtgtcagtcttcagctgttaagctgagaagatctgggaaggagtcagtc agagagccttgggccagagttccaggagctctgggagtggctgtcaggccattgcccatt caccccatgtctgtcacggagctcttttatggggcaccttcactgagccttgagactgtg gccgtcctgcttctaggctctggcactggggacaagggcagagcacactctggtggggta gcagagccagctgcagacaattatgacagggcacctgtgcgagcatcatctcgagtgaca cacagaagcaccttgcaaattgtgttcacctattacgatctgtcgcaggctacctgcgaa gtcagcaagcacttcaaccctattttacagggccaagcagcaccttga >gi568815597f:53359454_53566659|GENSCAN_predicted_peptide_3|397_aa MQGSCCSSRDTLRAALLQVYPMSPERGTCGARLIPAPLALQLPEVVVVLRRLTLPMADKM VRTPKCSRCRNHGFLVPVKGHAGKCRWKQCLCEKCYLISERQKIMAAQKVLKTQAAEEEQ EAALCAQGPKQASGAAAAAPAPVPVPAASLRPLSPGTPSGDADPGPEGRAAACFFEQPPR GRNPGPRALQPVLGGRSHVEPSERAAVAMPSLAGPPFGAEAAGSGYPGPLDLRRPMRTVP GPLFTDFVRPLNINPDRALGPEYPGGSSMHPYCPFPLGYLDAPPGVPLQQGFRHVSRSQY QGGGLVSEPGGDFQPSYYLPPPPPPLPPLPPLPPQPQFLPPGYLSALHFLPPPPPPPPPS SFSLTVLFDTDKENTDDQDAEVLSGEPSQPSSQEQSD >gi568815597f:53359454_53566659|GENSCAN_predicted_CDS_3|1194_bp atgcaaggttcctgctgcagttcccgggacacactgcgagccgctttgttacaagtgtat ccaatgtccccagaacgagggacttgtggggcgcggctcatcccagcgccacttgctctg cagctcccagaggtggtggttgtgttacgaaggctgaccctgccaatggccgacaaaatg gtgcgcacccccaagtgctcgagatgcaggaaccatggcttcctggtgcccgtcaaggga cacgcgggcaaatgccgctggaagcagtgcctctgcgagaagtgctacctgatctccgag cgccagaagatcatggccgcgcagaaggtgctcaagacgcaggccgccgaggaggagcag gaggcggccctgtgtgcgcaggggcccaagcaggcctccggggctgcggccgccgccccc gcccccgtccccgtcccggccgcgagcctccgcccgctgtccccggggactccctccgga gacgccgacccgggacccgagggccgcgcggccgcttgcttcttcgagcagcccccgcgg ggccggaaccccggcccgagagccctccagccggttctgggcggccgcagccacgtggag ccgagcgagcgagccgccgtggcgatgcccagccttgcgggacccccttttggggcggag gccgcaggcagtggctaccctggccccctagacctgcgcaggccgatgcggaccgtgccc ggcccactgttcaccgactttgtgcgccctctgaacatcaacccggaccgtgcactgggc cctgagtaccctggtggctccagcatgcacccctactgcccgttcccgctgggctacctg gacgcccctcctggcgtccccctgcagcagggcttccggcatgtgtcccgcagccagtac caaggcggaggcttggtgtcagaaccaggaggagacttccagccaagctactacctgccg ccgccgccgccgccactgccgccccttccaccgcttccaccgcagccccagttcctcccg ccaggctacctctctgcgctccacttcctccccccgccaccgccaccaccacctccatca tctttctcactgaccgtcctgtttgatactgacaaggagaacactgatgaccaggatgca gaggtactgtcgggtgagcccagccagccatcgtctcaggagcagtccgactag >gi568815597f:53359454_53566659|GENSCAN_predicted_peptide_4|202_aa MLSVMNHHGNASQNRSGMHFITAGMAIIKRSDTKHCCLASCFQAFVRAVPSVRSTLPSSF PGAMITLHSLLAVAPNPGPVWFLTGYSQQIEKKQPTLAPEPRVQPSLQRGFSGRGCSLTA AQERALLSFHSQRSQLREGERNLHAHMPEEEHENPRMPPEYPVAEKAAAHNTTPWVNVSN VIFSEEASPQRLHPKTAVQKTI >gi568815597f:53359454_53566659|GENSCAN_predicted_CDS_4|609_bp atgctcagcgtcatgaaccatcatggaaatgcaagtcaaaatcgcagtgggatgcacttc ataaccgctgggatggccatcatcaaacggtcagatactaagcactgttgccttgcctct tgcttccaggcctttgtccgtgctgttccttctgtccggagcactcttccttcaagcttc ccaggtgcaatgatcacattgcactcattgcttgctgtagctcctaaccctggtcccgtg tggtttctgactgggtactcgcagcaaatagagaagaaacagcccaccctagccccagag cccagagtgcagccttccctgcagcggggattttcaggcagaggctgcagcctcacagca gcccaggaaagggcactgctatcattccactcgcagagaagccagcttagagaaggggag cgtaacctgcatgcccacatgccagaggaagagcacgagaaccccaggatgcccccagag taccctgtagcagagaaggcagcagcacacaacaccacaccatgggtgaatgtgagcaac gtcatattcagtgaggaagcctcgccccaaagactacatcctaaaacagctgttcaaaag accatctag >gi568815597f:53359454_53566659|GENSCAN_predicted_peptide_5|767_aa XSTLPGRYEGLTGGHQGETDICSVCGPDISRPELSVCQWNPCPCKQGASSCLTREAQCWD RDESTQLLPPGLGKAGSSLSVFAGLRPGGAGRVGEDELQQRLAGAQDQALSRIAAAGLDP EGEQSRMVYPGFEAQCVLAKADGDGVHLHLPDEPFSASQELGGVTLRGGTHILQCIDPWD IRDAGPMESLRCPRVVVPLPEGEPQRKDVKSRLWERQSSGNTAATSSSLATCCWEPMLAP GSEGQGPMGPNLGCSKAFSRLENLKIHLRSHTGEKPYLCQHPGCQKAFSNSSDRAKHQRT HLDTIQPCFQPLTAIGRRTWFLVAPRAVRCLCTYLQGAKPYACQIPGCSKRYTDPSSLRK HVKAHSAKEQQVRKKLHAGPDTEADVLTECLVLQQLHTSTQLAASDGKGGCGLGQELLPV LLLCYTEHLLSARPGAGPVLELLPVQRGGQDQAGVYPGSITPHNGLASGLLPPAHDVPSR HHPLDATTSSHHHLSPLPMAESTRDGPGPSTFLLPCPSSSWLQPWPWPACAWGQPGFIAT FVAHCSYYYWAPPAIHSPIHSVFNYGAPVGRPWAQCWCLDDSGPGDAYCERETYHTGGQW LGPGLLSPIVSPLKGLGPPPLPPSSQSHSPGGQPFPTLPSKPSYPPFQSPPPPPLPSPQG YQGSFHSIQSCFPYGDCYRMAEPAAGGDGLVGETHGFNPLRPNGYHSLSTPLPATGYEAL AEASCPTALPQQPSEDVVSSGPEDCGFFPNGAFDHCLGHIPSIYTDT >gi568815597f:53359454_53566659|GENSCAN_predicted_CDS_5|2304_bp ncctccactctccctggaaggtacgaagggttaactggtgggcaccagggggaaacagat atttgctccgtgtgtggaccagatatctcccgaccggaactgtccgtctgccagtggaac ccctgcccttgcaaacagggagcgagctcctgcctgactcgtgaggcccagtgctgggac agagatgagtcaacacagctgctgcccccaggtctgggcaaagctggctcttctttgtct gtcttcgctgggctgaggccagggggtgcagggcgtgtgggggaggatgagctgcaacag aggctcgcaggggctcaggaccaggctctgtcccgcattgcagcggctgggctcgaccct gagggggagcagagccgcatggtttatcctggttttgaagcacaatgtgtgctcgccaag gctgatggggatggggttcatctacatcttccggacgagccattctcagccagccaggag cttggtggcgtcaccctgcgaggaggcacccacatccttcagtgcatcgacccctgggac atccgagatgcaggccccatggaatcattacgatgccctcgggtcgtggtgccattacct gaaggtgagccccagagaaaggatgtgaagagcaggctttgggagcgccagtcctcgggt aacacggctgctaccagctcatccttggccacctgctgctgggagcccatgctggcccct gggtcagaggggcagggtcccatgggccctaatctgggctgcagcaaggccttctcacgg ctggagaacctcaagatccacctgaggagccacacgggcgagaagccgtacctgtgccag cacccgggttgccagaaggccttcagcaactccagcgaccgcgccaagcaccagcgcacc cacctagacacgatccagccgtgcttccagcctctcacagctattggcaggaggacctgg tttcttgtggccccaagagcagtccggtgcctctgcacctacctgcaaggtgcgaagccg tacgcctgtcagatccctggctgctccaagcgctacacagaccccagctccctccgcaag cacgtcaaggcccattcagccaaagagcagcaggtgcgtaagaagctgcatgcgggccct gacaccgaggccgacgtcctgaccgagtgtctggtcctgcagcagctccacacgtccaca cagctggctgccagcgacggcaagggtggctgtggcctgggccaggagctgctcccagtg ctgctcctctgctatactgagcacctgctgtctgccaggcccggtgctggccctgtcctc gagctgcttccagtgcagcggggagggcaggaccaggcaggtgtgtatcctggctccatc accccccataacggacttgcatcgggcctcctgcccccagcgcacgacgtaccttccagg caccacccgctggatgccaccaccagttcccaccaccatctgtcccctctgcccatggct gagagcacccgggatgggcctggccccagcaccttcctccttccctgcccttcctcctcc tggctacagccttggccatggcctgcctgcgcctggggccagcctggcttcatcgccaca tttgtggcccactgctcctattattactgggctcctcctgccattcattcacccattcat tcggtgttcaattatggagcccctgtgggcaggccctgggcccagtgctggtgcctcgat gactctggccctggagatgcttactgtgagagggagacctaccacacagggggccagtgg ttggggcccggcctcctctcaccaatagtcagccccctgaaggggctggggccaccgccg ctgcccccatcctctcagagccattctccggggggccagcccttccccacactccccagc aagccgtcctacccacccttccagagccctccacccccgcctctgcccagcccacaaggt taccagggcagtttccactccatccagagttgcttcccctatggcgactgctaccggatg gctgaaccagcagccggtggggacggactggtcggggagacccacggtttcaaccccctg cggcccaatggctaccacagcctcagcacgcccttgcctgccacaggctatgaggccctg gctgaggcctcatgccccacagcgctgccacagcagccatctgaagatgtggtgtccagc ggccccgaggactgtggcttcttccccaatggagcctttgaccactgcctgggccacatc ccctccatctacacagacacctga