GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:12:57 Sequence gi568815580f:75185543_75388638 : 203096 bp : 44.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3442 3518 77 0 2 80 37 64 0.076 1.06 1.02 Term + 14552 14678 127 2 1 28 48 182 0.181 5.96 1.03 PlyA + 15380 15385 6 -0.45 2.05 PlyA - 15682 15677 6 1.05 2.04 Term - 16798 15874 925 0 1 96 42 735 0.954 61.35 2.03 Intr - 22397 22324 74 2 2 73 83 49 0.741 1.10 2.02 Intr - 22818 22773 46 2 1 108 45 44 0.298 0.51 2.01 Init - 23516 23308 209 2 2 70 105 413 0.980 39.49 2.00 Prom - 24627 24588 40 -3.66 3.03 PlyA - 24690 24685 6 1.05 3.02 Term - 35295 35144 152 1 2 36 44 113 0.527 -0.23 3.01 Init - 39935 39800 136 2 1 81 48 103 0.684 5.92 3.00 Prom - 48356 48317 40 -5.66 4.08 PlyA - 48692 48687 6 1.05 4.07 Term - 55958 55810 149 0 2 81 43 119 0.725 4.76 4.06 Intr - 59249 59037 213 0 0 28 -16 227 0.199 4.99 4.05 Intr - 72710 72565 146 1 2 29 94 85 0.060 3.13 4.04 Intr - 75589 75518 72 0 0 24 66 135 0.060 3.42 4.03 Intr - 80586 80505 82 1 1 64 105 49 0.047 2.90 4.02 Intr - 84389 84305 85 2 1 12 70 72 0.024 -2.81 4.01 Init - 87585 87445 141 0 0 69 93 115 0.582 10.23 4.00 Prom - 89482 89443 40 -4.66 5.00 Prom + 93834 93873 40 -1.96 5.01 Init + 95986 96078 93 0 0 62 100 68 0.489 5.30 5.02 Intr + 97019 97231 213 1 0 38 92 66 0.593 0.91 5.03 Intr + 97994 98066 73 1 1 50 85 57 0.907 0.58 5.04 Term + 99906 103099 3194 1 2 85 40 5060 0.993 486.32 5.05 PlyA + 103453 103458 6 1.05 6.00 Prom + 104001 104040 40 -8.26 6.01 Init + 106077 106155 79 2 1 85 75 87 0.994 8.32 6.02 Intr + 111062 111130 69 0 0 75 110 36 0.375 3.65 6.03 Intr + 119148 119264 117 1 0 97 43 140 0.392 10.84 6.04 Intr + 124790 124965 176 0 2 64 46 85 0.249 1.56 6.05 Term + 128777 128917 141 1 0 87 32 94 0.283 1.63 6.06 PlyA + 129968 129973 6 1.05 7.06 PlyA - 130197 130192 6 1.05 7.05 Term - 131537 131414 124 1 1 57 43 75 0.154 -2.24 7.04 Intr - 134967 134806 162 2 0 79 73 75 0.331 4.19 7.03 Intr - 136081 135975 107 1 2 100 94 -5 0.304 0.31 7.02 Intr - 145776 145647 130 2 1 72 -8 134 0.051 2.90 7.01 Init - 155966 155794 173 2 2 84 75 130 0.745 10.22 7.00 Prom - 161175 161136 40 -0.96 8.05 PlyA - 162084 162079 6 1.05 8.04 Term - 173645 173494 152 0 2 74 37 103 0.129 1.87 8.03 Intr - 182546 182490 57 0 0 44 82 67 0.406 0.56 8.02 Intr - 183230 183056 175 2 1 78 26 138 0.680 6.11 8.01 Init - 199432 199292 141 1 0 90 72 88 0.109 7.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 185252 185088 165 2 0 65 50 105 0.807 4.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_1|67_aa MGKTLSQTKEPWIRQGGKLLILKEIVLDTPKPKTLKNNLVGFTFNQNGPMGFQRNGDSGS SLQGTKG >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_1|204_bp atggggaaaacattatcacaaaccaaggaaccgtggattcggcagggtggaaaactcctc atcctcaaagaaatagtgttagatacaccgaaacctaagactttgaaaaataatctggtg ggcttcaccttcaatcagaacgggcccatggggtttcagcgcaacggtgacagcggcagc agtcttcagggcactaaaggttaa >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_2|417_aa MLRLVPTGARAIVDMSYARHFLDFQGSAIPQAMQKLVVTRLSPNFREAVTLSRDCPVPLP GDGDLLVRNRLAAASQWRSLSLDVWRPAKSHRNVYGAACVKLNNNIEREEFVGVNASDIN YSAGRYDPSVKPPFDIGFEGIGEVVALGLSASARYTVGQAVAYMAPGSFAEYTVVPASIA TPVPSVKPEYLTLLVSGTTAYISLKELGGLSEGKKVLVTAAAGGTGQFAMQLSKKAKCHV IGTCSSDEKSAFLKSLGCDRPINYKTEPVGTVLKQEYPEGVDVVYESVGGAMFDLAVDAL ATKGRLIVIGFISGYQTPTGLSPVKAGTLPAKLLKKSASVQGFFLNHYLSKYQAAMSHLL EMCVSGDLVCEVDLGDLSPEGRFTGLESIFRAVNYMYMGKNTGKIVVELPHSVNSKL >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_2|1254_bp atgctgcggctggtgcccaccggggcccgggccatcgtggacatgtcgtacgcccgccac ttcctggacttccagggctccgccattccccaagccatgcagaagctggtggtgacccgg ctgagccccaacttccgcgaggccgtcaccctgagccgggactgcccggtgccgctcccc ggggacggagacctcctcgtccggaaccggttggcagccgcctcgcagtggcggagcctg tcgttggatgtgtggaggcctgcaaagtcccatcgaaatgtttacggtgctgcatgtgtc aagttgaataataatatcgaaagggaggaatttgttggtgttaacgcatctgacatcaac tattcagcaggccgctatgacccctcagttaagcctccctttgacataggtttcgaaggc attggggaggtggtggccctaggcctctctgctagtgccagatacacagttggccaagct gtggcttacatggcacctggttcttttgctgagtacacagttgtgcctgccagcattgca actccagtgccctcagtgaaacccgagtatcttaccctgctggtaagtggcaccaccgca tacatcagcctgaaagagctcggaggactgtcggaagggaaaaaagttttggtgacagca gcagctgggggaacgggccagtttgccatgcagctttcaaagaaggcaaagtgccatgta attggaacctgctcttctgatgaaaagtctgcttttctgaaatctcttggctgtgatcgt cctatcaactataaaactgaacccgtaggtaccgtccttaagcaggagtaccctgaaggt gtcgatgtggtctatgaatctgttgggggagccatgtttgacttggctgtagacgccctg gctacgaaagggcgcttgatagtaatagggtttatctctggctaccaaactcctactggc ctttcgcctgtgaaagcaggaacattgccagccaaactgctcaagaaatctgccagcgta cagggcttcttcctgaaccattacctttctaagtatcaagcagccatgagccacttgctc gagatgtgtgtgagcggagacctggtttgtgaggtggaccttggagatctgtctccagag ggcaggtttactggcctggagtccatattccgtgctgtcaattatatgtacatgggaaaa aacactggaaaaattgtagttgaattacctcactctgtcaacagtaagctgtaa >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_3|95_aa MGQLTVRRKRRGQAITLTHGQKRSHLQLSRELHAMLTYRQASLTNKFQKDTMDIERKKCL SSKKTAYCCSSVVDEQQKQIQKREPGKHEKVKQAA >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_3|288_bp atgggccagctcacagtcagaagaaagcggcgtggacaggccatcacgctgacacacgga cagaagcgatcacatctccaactctcccgcgagctccacgccatgctcacctaccgccag gcctcactcacaaacaaatttcagaaagacacaatggacatagaaagaaaaaaatgtcta tcatctaagaaaactgcttactgctgcagcagtgtggtggatgaacaacaaaagcaaatc cagaaacgagagcctggaaaacatgaaaaggttaaacaagcggcttag >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_4|295_aa MLCFLKVSFKSMLKNLADTPNTHTHYGTSLYPISSSSVAKYPDEKQKDCHLWDGYKALSE SVSSHVDSGFITETEVNLVQRDSIDHSPGDGDLCGLGSGHQKRSCSSGYIAELDFSVSVG GDVANETEHPDSGYHVHSINTSNKTRMVAVEEHRGRVPVLAFTHCSDAADTESRSVITGR LTGNLIPQETRGTEAQRCTEPAEHSIPKALYASKKQGRTQDWQSNDHKERLTIPGFRPNR TGESRPGTDAHPANPRPVTPYAEGETHLPPSAMQQVATQMAQESQAPASQSYGAI >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_4|888_bp atgctctgctttctaaaagtcagcttcaaatctatgctgaaaaatctagcagacacccca aatacccacacgcactatggaacatccctgtaccccatctcttcctcctcggtggcaaaa tatcctgatgaaaagcagaaggactgccatttatgggatggctacaaagccctgtcggaa tctgtttccagccacgtcgactcaggcttcatcacagagacagaagtaaaccttgtccaa cgggacagcatcgatcactcacccggggatggggacctctgcggtctaggctcaggacac caaaaaaggtcctgctcctccgggtacatcgcggaactggacttctctgtctccgtggga ggagatgtggccaatgaaacagagcaccctgacagtgggtaccacgtgcactccatcaac acatccaacaaaacccgcatggtggctgtggaggagcacagaggaagggtccctgtactg gccttcacacactgcagtgacgctgcagacaccgagtcacgctcagtcataacagggaga ctcaccgggaacctcatcccccaggaaacacggggcactgaagctcaacgctgcactgag cctgcagaacacagcatacccaaagcgctttatgccagcaaaaagcaaggacggactcaa gattggcagagcaacgaccacaaagaaaggctcaccatcccggggttccggcccaaccgc actggagaaagccggcctggcacagacgcacatcccgccaacccccggcccgtgactcct tatgcggagggggagactcacctgcccccatcagcgatgcagcaagtagccacccagatg gcacaggagagccaggccccagctagccagtcctatggtgccatctga >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_5|1190_aa MTGVLPDGPVSGRCAGVLMAVVGTGPGVVEKGRRPTEVALKEVEELSLSTCWKEEPFRER AAILKPGRALTRNQTLPDLNLGLPVSRTGRKEVLLVKPYPHTYSVNRIVIPGFAAFPHTS MKCVNTAYVPEEELKAAEIDEEHVEDDGLSLDIQESEYMCNEETEIKEAQSYQNSPVSSA TNQDAGYGSPFSESSDQLAHFKGSSSREEKEDPQCPDSVSYPQDSLAQIKAVYANLFSES CWSSLALDLKKSGSTTSTNDASQKESSAPTPTPPTCPVSTTGPTTSTPSTSCSSSTSHSS TTSTSSSSGYDWHQAALAKTLQQTSSYGLLPEPSLFSTVQLYRQNNKLYGSVFTGASKFR CKDCSAAYDTLVELTVHMNETGHYRDDNRDKDSEKTKRWSKPRKRSLMEMEGKEDAQKVL KCMYCGHSFESLQDLSVHMIKTKHYQKVPLKEPVPAITKLVPSTKKRALQDLAPPCSPEP AGMAAEVALSESAKDQKAANPYVTPNNRYGYQNGASYTWQFEARKAQILKCMECGSSHDT LQQLTAHMMVTGHFLKVTTSASKKGKQLVLDPVVEEKIQSIPLPPTTHTRLPASSIKKQP DSPAGSTTSEEKKEPEKEKPPVAGDAEKIKEESEDSLEKFEPSTLYPYLREEDLDDSPKG GLDILKSLENTVSTAISKAQNGAPSWGGYPSIHAAYQLPGTVKPLPAAVQSVQVQPSYAG GVKSLSSAEHNALLHSPGSLTPPPHKSNVSAMEELVEKVTGKVNIKKEERPPEKEKSSLA KAASPIAKENKDFPKTEEVSGKPQKKGPEAETGKAKKEGPLDVHTPNGTEPLKAKVTNGC NNLGIIMDHSPEPSFINPLSALQSIMNTHLGKVSKPVSPSLDPLAMLYKISNSMLDKPVY PATPVKQADAIDRYYYENSDQPIDLTKSKNKPLVSSVADSVASPLRESALMDISDMVKNL TGRLTPKSSTPSTVSEKSDADGSSFEEALDELSPVHKRKGRQSNWNPQHLLILQAQFASS LRETTEGKYIMSDLGPQERVHISKFTGLSMTTISHWLANVKYQLRRTGGTKFLKNLDTGH PVFFCNDCASQFRTASTYISHLETHLGFSLKDLSKLPLNQIQEQQNVSKVLTNKTLGPLG ATEEDLGSTFQCKLCNRTFASKHAVKLHLSKTHGKSPEDHLIYVTELEKQ >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_5|3573_bp atgacaggcgtcctccctgatgggcctgtgtctggcaggtgtgcaggggtgctcatggct gtggtcggcacaggccctggggttgtggagaagggccgtcgtccaacagaggtggcctta aaagaagtggaagagctctctctttccacctgctggaaggaagagccattcagggagaga gctgccatcctcaagccagggagagccctcaccagaaaccaaaccctgccagaccttaac ctgggacttccagtctccaggactgggagaaaggaagttctgttggtcaagccatatccg catacgtacagtgtgaacaggattgtgattcctggctttgctgcttttccacatacctcc atgaaatgcgtgaacacagcttatgttcctgaggaagaattgaaggcagcagaaatagat gaagagcacgtggaggatgacgggctgtctttggacattcaggaaagtgagtacatgtgc aatgaagagacggagatcaaagaggcgcagagctaccagaactccccagtcagctctgcg actaaccaggacgccggctacgggtcgcccttcagtgagagcagcgaccagctagcccat ttcaaaggctcttcctctcgagaagagaaggaggatccgcagtgtcccgacagcgtctcg tacccccaggacagcctggcacagatcaaagctgtgtatgcaaacttgttctccgagtcc tgctggtccagcttagctctggatttaaagaagtcgggttccaccaccagcaccaacgat gccagccagaaggagagctccgcccccacccccacaccccccacctgccccgtcagcacc actggccccaccacgagcacgcccagcaccagctgcagctccagcaccagccacagcagt accaccagtaccagcagcagctccgggtacgactggcaccaggctgcactggccaagacg ctgcagcagacgtcctcgtatgggctgcttcctgagcccagcctgttcagcaccgtgcag ctctaccgccagaacaacaagctctacggctccgtcttcacgggcgccagcaagttccgg tgcaaagactgcagtgccgcgtacgacacgctggtggaactgacggtgcacatgaacgag acaggccactaccgtgacgacaacagggacaaggactccgagaagaccaagaggtggtcc aagcccaggaagcgctccctgatggagatggaggggaaggaggatgcccagaaggtgctg aagtgcatgtactgtggacactcctttgagtccttgcaggacctcagcgtccacatgatc aaaaccaagcattaccagaaagtgcctctgaaggagccagtgccagccatcaccaaactg gtcccctccaccaaaaagcgggcgcttcaggacctggcgcccccctgctcccctgagcca gcaggaatggccgcagaggtggccctgagtgagtcagccaaggatcagaaagcagcgaac ccgtacgtcacgcccaataaccgctatggctaccagaatggcgccagctacacctggcag tttgaggcccgcaaggcgcagatcctcaagtgcatggagtgtggcagctcccacgacacg ctgcagcagctcaccgcccacatgatggtcaccgggcacttcctgaaagtgaccacctcg gcttctaagaagggcaagcagttggtgctggaccctgtggtggaagagaagatccagtcc atcccactaccgcccaccacccacacgcggctgccggcctccagcatcaaaaagcagccc gactctcccgcggggtccacgacttctgaagaaaagaaagagccagagaaggagaagccg cctgtggctggcgacgcggagaagatcaaggaggagagtgaggacagcttggagaaattt gagcccagcaccctgtacccgtacctgcgtgaggaggacctggacgacagccccaaggga gggctggacattctcaagtccctggagaataccgtctccacggccattagcaaagctcag aatggtgcgccctcatggggtggctaccccagcatccatgcagcctaccagctcccgggc accgtgaagccactgccggcggccgtgcagagcgtgcaggtgcagccgtcctatgctggc ggcgtgaagtcgctgtcttccgccgagcacaacgccctcctgcactccccagggagcctc acgcccccaccgcacaagagcaacgtgtctgccatggaggagctggtggagaaggtcacg ggcaaggtcaacatcaagaaggaggagagaccccctgagaaggagaagagctccctggcc aaggctgcgtcccccatagcaaaagagaataaagatttcccgaaaacggaggaagtcagc ggcaaaccacagaagaagggccctgaggccgagactgggaaggccaaaaaggagggaccg ctggacgttcacaccccaaatggcacagagcctctcaaagcaaaggtcaccaacggctgt aacaacctggggatcatcatggaccactcaccggagccttccttcatcaacccgctgagc gctttgcagtccatcatgaacacccacctgggcaaggtgtccaagcccgtgagtccctcg ctggacccgctggcgatgctgtacaagatcagcaacagcatgctggacaagccggtgtac cccgccacccctgtgaagcaggccgatgccatcgaccgctactattatgaaaacagcgac cagcccattgacttaaccaagtccaagaacaagccgctggtgtccagcgtggctgattcg gtggcatcacctctgcgggagagcgcactcatggacatctccgacatggtgaaaaacctc acaggccgcctgacgcccaagtcctccacgccctccacagtttcagagaagtccgatgct gatggcagcagctttgaggaggcgttggacgagctgtcaccggtccacaagaggaagggc cggcagtccaactggaacccgcagcaccttctcatcctgcaggcccagttcgcctcgagc ttgcgggagaccacagagggcaagtacatcatgtcggacttgggcccgcaggagagggtg cacatctcgaagtttactgggctctccatgaccaccatcagccactggctggccaatgtg aagtaccagttgaggaggacagggggaacgaaattcctaaagaacctggacacagggcat cctgttttcttttgcaacgattgtgcctctcagttcagaactgcttctacatacataagt catttggagacacacttgggcttcagcctgaaggatctctccaagctgccactcaatcag attcaagaacagcagaatgtttcgaaagtcctcaccaacaaaactctgggcccactgggg gccaccgaggaagacttgggctccacattccaatgtaagctctgcaaccggacttttgcg agcaagcacgcagtcaaactgcaccttagtaagacccacggcaagtctcccgaggaccac ctgatctatgtgactgagttggagaaacagtag >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_6|193_aa MGFEPKKLLLQNHTLNTSLQALGSAPERFMLAKRFSIVEKAGEQNHGRAAVSTIRYNSEV AAKYVPSVLINWTAIEELLISANRHVLPAVRLVLYKPKSDHIGCQSKALTKAFQARVTRT FAFSLAPPPPTLSSQQCQPPSVSRVCLEGTVSLPSGGYSIQGTILEVGGSPHQPTKPAST LILDPPSCQRGEK >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_6|582_bp atgggcttcgagcccaaaaagctgttgctgcagaaccacactctgaacacctccttacag gccttgggatctgccccagaacgcttcatgttagccaagcgctttagcattgtagagaag gcaggagaacagaaccatgggagagcagctgtaagcacaattaggtataactcagaagtg gccgccaagtatgtgccctcagtgctcattaattggacagctattgaagagctgcttata agcgccaatcgccatgtcctgccagcagttagattagttctttacaaacctaaatcagac cacatcggctgccagtccaaagccctcacaaaggccttccaggcccgtgtgactcggacc tttgccttttctcttgctcctccacctcccactctgtcttctcagcaatgtcagcctcct tctgtttctcgggtttgcctggagggcacagtgtccctcccctctggaggatacagcatt caaggcaccatcttggaagtggggggcagccctcatcagccaaccaaacctgccagcacc ttgatcttggacccccccagctgccagaggggcgagaaataa >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_7|231_aa MVTLIRGQLSGKSLATQQGDREEKVDRARQTRVHPECTLALHIIAGLALSIWRHQVLRKK KEEEEEEKGRERGGEGEKEEDEEVEGEKPGGRGEIKGKTDEDGLETPPGQGSGTDLFTFV YAGPRKCRSTTGTSHISPSDAGEGEGQAVSFQGYDPEISHFCFSHPGGQIQSQGHIELRL KRAGGGTEFMHFNLIAPHLRRALRWAIASLSSGVAFKAFPGEGSQTDQCGS >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_7|696_bp atggtaacgctaatccgtggccaactgtctggcaaaagcttagcaactcaacagggagac agggaggagaaggtggacagagcccggcagacccgtgtccaccccgagtgcaccctggct cttcacatcatcgcagggctggccttgagcatctggaggcatcaggtcctcagaaaaaag aaagaggaggaggaggaagagaagggaagggaaagagggggagaaggagagaaagaagaa gatgaagaagtagaaggggaaaaaccaggaggcagaggagaaataaaaggaaaaactgat gaggatggactagaaactccaccagggcaaggatctgggactgatttgttcacgtttgtg tatgcaggccctagaaagtgcaggtccaccacaggcacttcacacatcagcccatctgat gcgggggaaggggaaggacaagcagtttcttttcaaggatatgacccagaaatttcacat ttttgcttctcacatcctggtggccagatccaatcacagggccacatcgagctgcgcctg aaacgtgctgggggaggaacagaattcatgcatttcaacctgattgccccacatctacgc agggctttgagatgggccatcgctagtctgtcctctggcgtggctttcaaggccttccct ggagagggctctcagaccgaccaatgtggctcatga >gi568815580f:75185543_75388638|GENSCAN_predicted_peptide_8|174_aa MTIRYLPCSFQDSNISLQEKNLDNFLVIVCIDTSPGLAVFKEEFWREGIGMSSLKPPQNC CESPGTAITVFHQLGGFSSRNGLVRSPGDGAKSKIKVAAVVLVIAAEKYAVTVSASRTPQ EPLGASYEPEPALECAPDPKTPFPLAQEFRQRLSHISYEGHAVAIHHPKSLGPV >gi568815580f:75185543_75388638|GENSCAN_predicted_CDS_8|525_bp atgacaattcgctacctgccatgcagtttccaagacagcaacatctcattacaggaaaaa aatctggacaattttctggttattgtctgcattgacacttctcctggattggcagtcttt aaggaagaattttggagggagggcattggcatgtcctctttgaagcccccacagaactgt tgtgagtctcctggaactgccataacagtgttccaccaactgggtggcttcagcagcaga aatggattggttcgcagtcctggagacggggctaagtctaagatcaaggtggctgcagtg gtgctggtgattgctgctgagaaatacgcagtcactgtcagcgcttcccgaactccccag gagcccctgggagcgtcctatgagccagagccggcattggaatgtgctcctgatcccaag acacctttccccctcgcccaagaattccgccagcgactgagtcacatttcctatgagggc catgctgtagctattcatcatcctaaaagcctgggtcctgtgtga