GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:01:10 Sequence gi568815576f:44083330_44306423 : 223094 bp : 50.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 438 656 219 2 0 76 110 76 0.516 5.26 1.02 Term + 7011 7094 84 2 0 102 45 52 0.201 -0.05 1.03 PlyA + 7974 7979 6 -0.45 2.00 Prom + 8750 8789 40 -3.26 2.01 Init + 9440 9620 181 1 1 90 77 98 0.837 8.44 2.02 Intr + 10599 10688 90 1 0 91 60 161 0.985 13.57 2.03 Intr + 14429 14464 36 0 0 109 103 4 0.533 2.23 2.04 Intr + 16724 16794 71 0 2 108 116 119 0.542 15.60 2.05 Intr + 20191 20280 90 0 0 98 64 21 0.226 0.89 2.06 Intr + 21854 21940 87 1 0 91 115 24 0.379 5.57 2.07 Intr + 35709 35811 103 2 1 102 92 228 0.996 24.35 2.08 Intr + 46045 46149 105 2 0 48 56 87 0.011 1.69 2.09 Intr + 48158 48298 141 0 0 122 74 176 0.880 19.92 2.10 Intr + 49565 49680 116 0 2 110 100 167 0.998 20.27 2.11 Intr + 56795 56814 20 1 2 128 123 24 0.594 5.31 2.12 Intr + 62052 62095 44 0 2 156 75 -3 0.743 3.08 2.13 Intr + 64532 64593 62 0 2 76 105 171 0.995 16.05 2.14 Intr + 68154 68222 69 2 0 90 91 130 0.968 12.88 2.15 Intr + 69170 69283 114 1 0 44 82 44 0.342 0.04 2.16 Intr + 74653 74754 102 0 0 38 95 126 0.636 8.67 2.17 Intr + 80529 80601 73 2 1 139 90 115 0.613 15.78 2.18 Intr + 81317 81585 269 0 2 57 -11 113 0.192 -4.45 2.19 Intr + 84767 84944 178 1 1 44 131 62 0.423 5.59 2.20 Term + 85273 85349 77 2 2 103 49 102 0.958 5.80 2.21 PlyA + 85871 85876 6 1.05 3.00 Prom + 93585 93624 40 -4.16 3.01 Init + 94772 94782 11 1 2 96 74 10 0.146 0.42 3.02 Intr + 99913 100079 167 1 2 122 93 152 0.856 18.70 3.03 Intr + 102479 102543 65 0 2 84 87 78 0.995 5.74 3.04 Intr + 104447 104549 103 1 1 27 94 131 0.416 7.35 3.05 Intr + 105785 105925 141 0 0 105 51 295 0.808 27.82 3.06 Intr + 107222 107337 116 0 2 137 54 173 0.259 18.97 3.07 Intr + 113018 113086 69 1 0 61 91 128 0.998 9.78 3.08 Intr + 115292 115393 102 1 0 79 115 71 0.997 9.27 3.09 Intr + 122428 122500 73 0 1 48 113 171 0.950 14.58 3.10 Term + 122988 123097 110 1 2 136 46 122 0.998 11.47 3.11 PlyA + 123304 123309 6 1.05 4.00 Prom + 129749 129788 40 -4.76 4.01 Sngl + 130052 130768 717 1 0 20 42 396 0.802 24.43 4.02 PlyA + 131587 131592 6 1.05 5.03 PlyA - 131921 131916 6 1.05 5.02 Term - 141106 140901 206 2 2 74 46 90 0.644 0.93 5.01 Init - 142459 142384 76 1 1 66 55 86 0.421 4.35 5.00 Prom - 149267 149228 40 -3.56 6.00 Prom + 160392 160431 40 -3.16 6.01 Init + 162718 162769 52 0 1 70 80 87 0.064 5.44 6.02 Intr + 177036 177142 107 1 2 58 63 104 0.497 4.83 6.03 Intr + 178248 178419 172 2 1 103 86 12 0.751 2.02 6.04 Intr + 180580 180744 165 2 0 12 62 157 0.549 5.43 6.05 Intr + 186897 186979 83 1 2 58 99 68 0.300 4.26 6.06 Term + 193268 193390 123 1 0 38 55 111 0.262 1.18 6.07 PlyA + 195037 195042 6 1.05 7.08 PlyA - 195054 195049 6 -3.94 7.07 Term - 195685 195453 233 2 2 44 42 137 0.252 1.34 7.06 Intr - 196129 195987 143 0 2 70 -17 123 0.323 -0.00 7.05 Intr - 202416 202099 318 2 0 121 105 241 0.734 24.17 7.04 Intr - 213556 213343 214 2 1 73 89 293 0.051 25.67 7.03 Intr - 215600 215511 90 0 0 34 88 57 0.472 0.27 7.02 Intr - 216164 216084 81 0 0 89 100 33 0.888 4.21 7.01 Intr - 217648 217550 99 2 0 66 98 96 0.927 8.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 80342 80473 132 1 0 83 25 166 0.803 7.94 S.002 Intr + 112826 112884 59 0 2 51 75 87 0.845 2.30 S.003 Term + 210538 210758 221 2 2 97 48 160 0.962 10.10 S.004 Term - 213556 213339 218 2 2 73 48 306 0.947 22.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_1|100_aa MKVLRFTLELVGAMGCTAAGSDQRQLCVLEGFLHQTGLSWFAKDRKPNSKWLKSKTEFTG TFDRMVCGVGWPQAIVHNQPLQNALLSSPWAVFVSTFLIY >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_1|303_bp atgaaggttttgcgttttaccctggagctggtgggagccatggggtgcactgcagcaggg agtgaccaacgccagctttgtgtattggaaggatttctacatcagacaggactctcttgg tttgcaaaggacagaaaacccaactcaaaatggcttaaaagcaaaacagaattcactggt acatttgacagaatggtctgtggggttggatggcctcaggccatcgtgcataaccagccc cttcagaatgcgctgctgagcagtccctgggcggtgttcgtgagcacgtttctcatttat taa >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_2|675_aa MSGALLACCSKAIMPHPLAVPSSAAVEAGCPELEVFKVQGDSGVKRGNTILLLNLFHGER VSDLQEEGKNAINSPMSPALVDVHPEDTQLGGVRNESYPTGTEENEERTMIDPTSKEDPK FKELVKGTWARIMVPKDDHVIIPGAIDYHLTWQRDMPLQQLADRVQPKAPSLNDIVTPWL ARGPQVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLAAQTNESSVLRSSADSSSSTL GDVAMFSLRGRMVAEKLAGCKLNVAEVTQSEIGQKQKLQTVLEAVHDLLRPRGWALRWSV DSIHGKNLVAILHLLVSLAMHFRAPIRLPEHVTVQVVVVRDDDGPVRSGWSLGRPLGYPK TERDAFDTLFDHAPDKLSVVKKSLITFVNKHLNKLNLEVTELETQPNTGPLHTQFLCLER AGPFSLCSVLLSHCGQDLANVPLFADGVYLVLLMGLLEDYFVPLHHFYLTPESFDQKVHN VSFAFELMLDGGLKKPKARPEGGAHRTQTLWFSALGPHFDRGPLPSPAPSPESPMVLYEL GAHDASYTCCQHPLNGGLCRVSKIARKWILSLPSTHCSHRFLPVHAEGSWPMKALSPKSR KSLRGHRASEMNKVSDSQEGSCPHKESGPRLPGPQRELSGPLSVTGVFGPDVVNLDLKST LRVLYNLFTKYKNVE >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_2|2028_bp atgagtggggccctgttggcctgctgctccaaagccatcatgccacacccccttgctgtt ccatcatctgccgctgtggaggctggatgtccagagctagaggtgttcaaggttcaaggt gattctggtgttaaaagaggaaataccattttacttctgaacctgtttcatggagaaaga gtgagtgacctgcaggaagaaggcaagaatgccatcaactcaccgatgtcccccgccctg gtggatgttcaccctgaagacacccagcttggcggggtcaggaatgaaagctaccccacg gggacagaggagaacgaggagcgcacgatgattgaccccacttccaaggaagaccccaag ttcaaggaactggtcaagggaacctgggccagaatcatggtccccaaagatgaccatgtg ataatccctggagccattgactatcatcttacatggcaaagggacatgcccctccagcag ctggctgatagagtgcaacccaaagccccctccctgaatgacatagttactccctggctg gcccgagggccccaggtcctcctcgactggattaatgacgtgctggtggaggagaggatc attgtgaagcagctggaggaagacctgtatgacggccaggtgctgcagaagctcttggca gcccagacaaacgaatccagtgtcctccgctcctctgccgactcctcttcctccaccctg ggtgacgtggccatgttcagtctgcggggccggatggtggcagaaaaactggcagggtgc aagctgaatgtggctgaggtgacacagtccgaaatagggcagaaacagaagctgcagacg gtgctggaagcagtacatgacctgctgcggccccgaggctgggcgctccggtggagcgtg gactcaattcacgggaagaacctggtggccatcctccacctgctggtctctctggccatg cacttcagggcccccatccgccttcctgagcatgtaacggtgcaggtggtggtcgtgcgg gatgatgatgggccggttcggtcagggtggtccctggggaggcccctgggttaccccaag acagagcgggatgccttcgacacgctgttcgaccacgccccggataagctcagcgtggtg aagaagtctctcatcacttttgtgaacaagcacctgaacaagctgaatttggaggtgacg gaactggagacccagcccaacacagggcctttgcacacgcagttcctctgtctagaacgt gctggccccttcagcctctgctcagtgctgctgtcacactgtggccaggacctagcaaat gtccccctgtttgcagatggcgtgtacctggttctgctcatgggccttctggaagactac tttgttcctctccaccacttctacctgactccggaaagcttcgatcagaaggtccacaat gtgtccttcgcctttgagctgatgctggacggaggcctcaagaaacccaaggctcgtcct gaaggaggagcccacagaactcagacactgtggtttagtgccctgggcccccactttgac cgtggccccctacctagccctgctccctctcctgagtcccccatggtgctatatgagttg ggagcccatgatgcttcctacacctgttgccagcatccgctcaacggcgggctttgtaga gtctccaaaatagctagaaaatggatcctgtcccttccatctacccactgcagccaccgc ttcctgccagtccacgcggaaggttcctggccgatgaaggcactgagccccaaatcacgg aagtcacttagaggtcacagagcaagtgaaatgaacaaggtgtctgattcccaggaaggt tcttgtccacataaagaaagtggcccaagacttccagggccccagagggagctgtcgggt ccattaagtgttactggggtttttggtcccgacgtggttaacttggacctcaaatccacc ctgagggttctttacaacctgttcaccaagtacaagaacgtggagtga >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_3|318_aa MVELSRENEGLSEFGASQTVTPSPASSVQAWEAMEPEFLYDLLQLPKGVEPPAEEELSKG GKKKYLPPTSRKDPKFEELQKVLMEWINATLLPEHIVVRSLEEDMFDGLILHHLFQRLAA LKLEAEDIALTATSQKHKLTVVLEAVNRSLQLEEWQAKWSVESIFNKDLLSTLHLLVALA KRFQPDLSLPTNVQVEVITIEAIVNFVNQKLDRLGLSVQNLDTQFADGVILLLLIGQLEG FFLHLKEFYLTPNSPAEMLHNVTLALELLKDEGLLSCPVSPEDIVNKDAKSTLRVLYGLF CKHTQKAHRDRTPHGAPN >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_3|957_bp atggtggaactgtctagagaaaatgaggggctcagtgagtttggggcttctcagaccgtg acgccctctcctgcctcctctgtgcaggcttgggaggcgatggagccggagttcttgtac gacctgctgcagctccccaagggggtggagcccccagcggaggaggagctctcaaaagga ggaaagaagaaatacctgccacccacttcccggaaggaccccaaatttgaagaactgcag aaggtgttgatggagtggatcaatgccactcttctccccgagcacattgtggtccgcagc ctggaggaggacatgttcgacgggctcatcctacaccacctattccagaggctggcggcg ctcaagctggaagcagaggacatcgccctgacagccacaagccagaagcacaagctcaca gtggtgctggaggccgtgaaccggagtctgcagctggaggagtggcaggccaagtggagc gtggagagcatcttcaacaaggacctgttgtctaccctgcacctccttgtggccctggcc aagcgcttccagcccgacctctccctcccaaccaacgtccaggtggaggtcatcactatc gaggccatcgtgaactttgtcaaccagaagctggaccgcctgggcctgtctgtgcagaat ctggacacccagtttgcagatggggtcatcttactcttgctgattggacaacttgaaggc ttcttcctgcacttaaaggaattctacctcactcccaactctcctgcagaaatgctgcac aacgtcaccctggcgctggagctgctgaaggacgagggcctgctcagctgccctgtcagc cctgaagatatcgtgaacaaggatgccaagagcacactgagggtgctctatggtctgttc tgcaagcacacgcagaaggcacacagggacaggacgccccatggagccccgaattga >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_4|238_aa MTSWVSSSPSPPLSPSSLSHHHHLHHYHHHHHHCHHHHHHGIIITIIILITSTVTIITIT SSSPPSLSPSSPPLSPSSPSHHHHHHHPHRHLHCHHHHHDIITTIIINITIITTTVTIIT ITSSSPPSSSSSPPHHQHHHHHHHCHHHHHQIVITIIMTIITTITPLSPSSPSLLLSSPS HHHHYHHHHHYHHHCHHHHHCHHHHHCCHYHHHHIIIIIIIIIIITTGTKTVLEVPPP >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_4|717_bp atgacatcatgggtgtcatcatcaccatcaccaccactgtcaccatcatcactatcacat catcatcacctccatcattatcaccatcatcaccaccactgtcaccatcatcaccatcac ggcatcatcatcaccattatcatcctcatcacctccactgtcaccatcatcaccatcaca tcatcatcacctccatcattatcaccatcatcacctccactgtcaccatcatcaccatca catcatcatcatcaccatcatcctcatcgtcacctccactgtcaccatcatcaccatgac atcatcaccaccatcatcatcaacatcaccatcatcaccaccactgtcaccatcatcacc atcacatcatcatcacctccatcatcatcatcatcacctccacatcatcaacatcaccat catcaccaccactgtcaccatcatcatcatcagatcgtcatcaccatcattatgaccatc atcaccaccatcacaccactgtcaccatcatcaccatcactgttactatcatcaccatca catcatcatcactatcatcaccatcatcactaccatcatcattgtcaccatcaccatcat tgtcaccatcatcaccactgctgtcactatcatcaccatcacatcatcatcatcatcatc attatcatcatcatcactacaggcacaaaaacagtgctggaagtccctcctccatag >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_5|93_aa MRRSVLDVLDVSRGVFDIHVENLVGEESEPCASLRCGPIGSGTCSFKRTSQLQLDVPRSH ESPQKIIEWLANIQAKPPVREAARRDMNEECAG >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_5|282_bp atgaggagatcagtgttggacgtgttggatgtgagcagaggtgtctttgacatccacgtt gaaaacctagttggagaggagagcgagccctgtgcctcgctcagatgcggccccattggg agcggcacatgctcttttaaaagaacatcccaactgcagttggacgtcccacgttcccat gaatcaccccagaaaataattgagtggctggcaaatattcaagccaaacccccagtgcgg gaggcagcaagacgtgacatgaacgaggagtgtgcaggatga >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_6|233_aa MVMVRWLAWPLEQPGEAGREECDMDSFSAQDLVTIQSLALETLSAGADSAIRLAILTAAQ WGYRSCGHSPAHQPTVAPGMLPHCTAFWKESLANVNSCSALTANTPWSHTLYYSSYGTLT QYPREMCAFTSPRDVDVKVHSSFTDDGRKVETTQVSSKGDGETQCDLSPESPSLLDAVNV EMSHRLTHGFGEEVLLMEHVKQERKGGAKEMVGGSGDVDVPVPHVYGTSKSCP >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_6|702_bp atggtcatggtcaggtggctggcctggccgctggagcagccaggtgaggcgggaagggaa gaatgtgacatggactcattctcagcccaggacctggtgacaattcagtctctggccttg gagacgctgtctgcaggtgcggacagtgccattaggctggccatcctcactgctgcccag tggggctatcgctcctgtggacacagcccagctcatcagcccacagtagccccgggaatg cttcctcattgcacagccttctggaaggaatctttggcaaatgtcaactcttgttctgct ctcacggcaaacacgccttggtcacataccctttattacagcagctacgggacactcaca caatacccaagagagatgtgtgcattcacttcaccaagagacgtggatgtgaaggttcat agcagctttactgatgacggccggaaggtggaaacaacccaagtttcctcaaaaggtgat ggagaaacacagtgtgacttgagtcccgaatctccctccctcttggatgccgtcaacgtt gagatgtcccacagactgactcatggctttggggaggaggtattgctcatggaacacgtg aagcaggaaagaaagggaggagccaaggagatggtgggaggcagcggggatgtggatgtg ccggtcccccacgtctatgggacatccaagagctgcccttga >gi568815576f:44083330_44306423|GENSCAN_predicted_peptide_7|392_aa XSSGRINASQTMTSCGQQSLNVLAVLFSLLFSADASHALAHLVLTIPGGRTHSAHFTNEE GLCWLREGSRGGFIVTKWPKPMALTAQWWPVLSAHFRVCEPYTDHKGRYHFGFHCPRLSD NKTFILCCHHNNTVFKYCCNETEFQAVMQANLTASSEGYMHNNYTALLGVWIYGFFVLML LVLDLLYYSAMNYDICKVYLARWGIQGRWMKQDPRRWGNPARAPRPGQRAPQPQPPPGPL PQAPQAVHTLRGDAHSPPLMTFQSSSAWEGASQQQEIPENEETEKGDDQISSFLGVTSNT KEASVIGIQKTVDVLISVPALDSDGDESLPWTMMVMRGPEGTQWDCRESPPPTHALVPNC ADSPECRLQSQPLGTLLKPKPMLLLPPHPGCG >gi568815576f:44083330_44306423|GENSCAN_predicted_CDS_7|1179_bp ngcagctctggacggatcaatgcaagccagacgatgaccagttgtggccagcagtccttg aacgtgctcgccgtcctcttctcattgctgttttctgcagatgcttcccatgctttggct cacctggtcctcaccatccctggaggacggacccactcagcccattttacaaatgaggaa gggctctgttggctccgggaaggcagccggggaggcttcatcgttaccaaatggccaaag cccatggcgctgaccgcccagtggtggccggtcttgtctgcacatttccgggtctgtgaa ccatacacagaccacaaaggccgctaccactttggcttccactgcccccggctctcggac aacaagaccttcatcctctgttgtcaccataacaacacggtcttcaaatactgctgcaac gagacggagttccaggcggtgatgcaggcgaacctcacggccagctccgagggttacatg cacaacaattacaccgccttgttgggagtgtggatctatggatttttcgtgttgatgctg ctggttctggaccttttgtattactcggcaatgaactacgacatctgcaaggtctacctg gcacggtggggcatccaaggacgatggatgaaacaggacccccggcggtgggggaacccc gctcgggcccctcggccgggtcagcgggccccacagccgcagcctcccccaggcccgctg ccacaagccccacaggccgtgcacacattgcggggagatgctcacagcccaccgctgatg accttccagagttcgtctgcctgggagggtgccagccaacagcaagaaattccagaaaat gaggagactgaaaagggagatgaccaaatatcttctttccttggcgtaacatcaaatacc aaggaggcttctgtgattggaattcagaagacagttgatgtcctgatctcagtgcctgcc ctggacagtgatggtgatgagagcctgccctggacaatgatggtgatgagaggtcccgag ggtacacagtgggactgcagggaatctcctcctcctacccacgccctggtgcccaactgt gcagactccccagagtgcaggctacagtcccagcctctaggaaccctcctcaagccgaag cctatgctgctcctcccgccgcatcctggatgtggctga