GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:34:53 Sequence gi568815581r:48626793_48828593 : 201801 bp : 46.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4378 4449 72 2 0 131 72 4 0.782 2.48 1.02 Intr + 6634 6761 128 2 2 109 86 118 0.795 14.10 1.03 Term + 6987 7277 291 2 0 41 54 165 0.963 3.74 1.04 PlyA + 8480 8485 6 1.05 2.10 PlyA - 9741 9736 6 -0.45 2.09 Term - 9981 9839 143 1 2 87 44 121 0.999 5.69 2.08 Intr - 10201 10076 126 2 0 61 94 102 0.058 8.75 2.07 Intr - 14427 14325 103 0 1 79 61 48 0.006 0.95 2.06 Intr - 16275 16230 46 2 1 129 51 21 0.010 0.91 2.05 Intr - 19693 19560 134 1 2 40 89 59 0.064 0.64 2.04 Intr - 20027 19813 215 0 2 45 92 118 0.059 6.33 2.03 Intr - 45191 45006 186 0 0 25 103 190 0.806 13.86 2.02 Intr - 52634 52606 29 1 2 47 94 30 0.278 -2.74 2.01 Init - 56533 56439 95 1 2 105 94 40 0.496 6.16 2.00 Prom - 57535 57496 40 -3.46 3.03 PlyA - 57748 57743 6 1.05 3.02 Term - 64584 64324 261 0 0 62 44 190 0.829 7.53 3.01 Init - 70712 70491 222 2 0 44 92 123 0.937 6.86 3.00 Prom - 90510 90471 40 -4.46 4.05 PlyA - 90657 90652 6 1.05 4.04 Term - 97145 97054 92 0 2 100 35 87 0.786 2.48 4.03 Intr - 97879 97666 214 1 1 18 24 246 0.624 9.49 4.02 Intr - 100251 100022 230 1 2 91 67 455 0.844 41.29 4.01 Init - 101801 101201 601 2 1 83 82 470 0.732 41.35 4.00 Prom - 103567 103528 40 -8.86 5.00 Prom + 105851 105890 40 -10.15 5.01 Init + 106651 106664 14 0 2 110 67 32 0.905 3.18 5.02 Intr + 107264 107388 125 2 2 121 68 31 0.567 4.63 5.03 Intr + 111749 111875 127 0 1 83 81 37 0.243 2.24 5.04 Intr + 118787 118952 166 2 1 5 57 160 0.247 4.56 5.05 Intr + 121090 121260 171 0 0 66 8 189 0.830 8.84 5.06 Term + 124554 124676 123 2 0 120 39 59 0.821 2.58 5.07 PlyA + 126797 126802 6 1.05 6.20 PlyA - 130359 130354 6 1.05 6.19 Term - 133846 133745 102 1 0 117 38 32 0.626 -0.62 6.18 Intr - 134841 134755 87 0 0 106 100 31 0.580 6.27 6.17 Intr - 135856 135738 119 2 2 46 55 67 0.370 -0.62 6.16 Intr - 142462 142211 252 2 0 90 39 75 0.307 0.21 6.15 Intr - 143305 142936 370 1 1 96 94 145 0.777 10.58 6.14 Intr - 158409 158131 279 0 0 98 103 193 0.988 19.47 6.13 Intr - 159543 159372 172 2 1 86 97 129 0.968 13.55 6.12 Intr - 161207 161019 189 1 0 91 74 203 0.996 17.90 6.11 Intr - 163314 163139 176 0 2 99 73 207 0.909 19.04 6.10 Intr - 164811 164586 226 2 1 105 75 317 0.890 30.09 6.09 Intr - 169354 169269 86 1 2 121 94 7 0.957 3.22 6.08 Intr - 170412 170269 144 0 0 125 93 109 0.998 15.58 6.07 Intr - 172968 172812 157 2 1 55 75 69 0.979 2.31 6.06 Intr - 174593 174463 131 2 2 93 61 70 0.952 4.29 6.05 Intr - 174870 174733 138 0 0 66 68 50 0.636 1.46 6.04 Intr - 177139 177110 30 1 0 130 76 20 0.834 3.33 6.03 Intr - 178199 177943 257 0 2 103 18 165 0.668 8.16 6.02 Intr - 185475 185370 106 0 1 112 99 72 0.641 10.49 6.01 Init - 190576 190490 87 1 0 55 62 97 0.333 2.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 10166 10076 91 2 1 58 94 139 0.899 12.17 S.002 Init + 19901 20036 136 1 1 47 117 182 0.889 15.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:48626793_48828593|GENSCAN_predicted_peptide_1|163_aa XLFLGGGGQGTPGMEDSQALSDWADTRLLTGAHAHTRTHRKNAAVQAEGGGGLTWAAAVP AVRGRRKKDVRGRIKRKGKQIYERQKHPPRPLSLPAHFVRVERSSPSPRDLASAEPAARR ALLSEESALTQPAPGRDWKPPPTRKIHRAPRRPYDRLGPPGLG >gi568815581r:48626793_48828593|GENSCAN_predicted_CDS_1|492_bp ncacttttcctgggagggggtggccaagggacaccagggatggaagacagtcaggctctg agtgactgggcagacacccgcctgctcacaggtgctcacgctcacacccgcacccatcgc aaaaacgcagctgtgcaggccgagggcggcggcggacttacctgggcggcggcggtgccg gcagtgcgcggccgtcggaagaaagacgtccgggggcggattaaaaggaaggggaaacaa atttacgagcggcagaagcacccgcctcggcccctttctttgcccgcacactttgtccgc gtggagagaagcagccccagcccccgggacttggcctccgccgaaccggccgccaggcgc gctcttctctcggaggagtcggcgctgacccagcccgcgcccggcagagactggaagccg ccgcctacacggaaaatccacagagccccgcgcagaccctatgatcggctcggcccgccg ggcttgggatga >gi568815581r:48626793_48828593|GENSCAN_predicted_peptide_2|358_aa MEKRVYLRASGKTSLKRWQLGWTVKVQERLVRNYVSNRLKTVSSYFRKKPLVSEPRQQPV LPSKLPDPKREFQNRQSKRFLAEVPGLETLGFLSEEHPLVSVHGERRSVSPFAPPAPSAS LPQTQTLPREASQRRGQKAATQSRHIPRLGRRHSPQTPWTPAVYEDARGTAWFREAWASR GHRLNLDLQTEAASRVGPRGAPENALASLSAGTRPYLSNRSVHSLVQGPELLGFEANMEE VVIGCCGLTGLSLRVPSTCDLVPGVPRSPVSSFASWGRGLKMLALPSSRLLGTSKEIRAV EGSLASEMELGEGAKGDVPAASSRPSPERLQEHAPSPGAGRRAARESCSARTQRQGAA >gi568815581r:48626793_48828593|GENSCAN_predicted_CDS_2|1077_bp atggaaaaacgggtctacctgagggcttcagggaaaacttccttgaagagatggcagctc ggctggactgtgaaggttcaagagaggctggtcaggaattatgtgagcaacagactcaag acagtctcctcctacttccgtaaaaagcctttagtttcagaaccaaggcagcagccagtt ctccccagtaaattaccagacccaaagagagagtttcagaaccggcagtcgaagcgcttc ctggcagaggtcccgggcctggaaacactcggctttctgagcgaggagcaccctctagtg tctgtccacggggaacgcaggagcgtgagccccttcgcgcccccagcgccgtcggcgtcg ctgccccagacacagacactgcctcgagaggcctcacagaggcgggggcagaaggcggcg acccagagccgccacatcccccgccttgggcgccgtcacagtccccagacgccctggact cctgcagtctacgaagacgcgcgggggacggcgtggttccgagaggcgtgggcttcccgg ggccaccggcttaaccttgatctccagaccgaggcagcttcccgggtagggcctcggggc gcacccgaaaacgccttggcctccctgtccgctggcacccgaccctacctctccaacagg tctgttcactccttggtccagggaccggagctcctgggcttcgaggcaaatatggaggag gtagtgattgggtgttgtggcctgactggtctcagtctaagagtccccagcacctgtgac ctggttcctggtgttccaagaagtccagtgtcttcttttgcaagctgggggaggggttta aagatgctggcgctgccctctagccggctgctggggacatcaaaggagattcgggctgtg gaaggctccctagcgtcggagatggagttaggagagggcgccaaaggcgacgtgccggcc gccagctccaggccgagccccgagcgcctgcaggaacacgccccttcacccggcgcggga cgcagagctgcgagagaatcttgttcagcgcggactcaacgccagggcgccgcctag >gi568815581r:48626793_48828593|GENSCAN_predicted_peptide_3|160_aa MKPKLIELKGEIEKSATGAEDFNPPFLTIDTTARQKINKDIEEFNKIINQQDLMNIYRTL HPTTAKCSFFSTTQENGSVVEIQNLLGEKYIHGVQMRPGVACSVSQAQKDEFILEGNDIE LVSNSAALIQQATTVKNKDIRKFLNGIYVSEKGTVQQADE >gi568815581r:48626793_48828593|GENSCAN_predicted_CDS_3|483_bp atgaagccaaaactgatagaactgaaaggagagatagaaaaatcggcaactggggctgaa gacttcaatcctcctttcttaacaattgatacgacagctagacagaaaatcaacaaggat atagaagagttcaacaaaatcatcaaccaacaagatctaatgaatatttatagaacactt cacccaacaacagcaaaatgttcattcttttcgactactcaggagaatgggtctgttgtt gaaatccaaaatttattgggtgaaaaatacatccacggggttcagatgagaccaggtgtt gcttgttcagtatctcaagcccagaaagatgaattcatccttgaaggaaatgacattgag cttgtttcaaattcagcggctttgattcagcaagccacaacagttaaaaacaaggatatc aggaaatttttgaacggtatctacgtctctgaaaagggaactgttcagcaggctgatgaa taa >gi568815581r:48626793_48828593|GENSCAN_predicted_peptide_4|378_aa MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPP KQCHPCPGVPQGTSPAPVPYGYFGGGYYSCRVSRSSLKPCAQAATLAAYPAETPTAGEEY PSRPTEFAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNS QMCCQGEQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKF ITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKENLRAGFHRPRSYRLPAAWGLVG TELRVGVQDPMSPAIRPGHRQMYWSSPRETQEEGKEVRYEPSRMEEEGEPNSRSESLSKL RNGITGSRLRGLQARNNF >gi568815581r:48626793_48828593|GENSCAN_predicted_CDS_4|1137_bp atggagcccggcaattatgccaccttggatggagccaaggatatcgaaggcttgctggga gcgggaggggggcggaatctggtcgcccactcccctctgaccagccacccagcggcgcct acgctgatgcctgctgtcaactatgcccccttggatctgccaggctcggcggagccgcca aagcaatgccacccatgccctggggtgccccaggggacgtccccagctcccgtgccttat ggttactttggaggcgggtactactcctgccgagtgtcccggagctcgctgaaaccctgt gcccaggcagccaccctggccgcgtaccccgcggagactcccacggccggggaagagtac cccagccgccccactgagtttgccttctatccgggatatccgggaacctaccagcctatg gccagttacctggacgtgtctgtggtgcagactctgggtgctcctggagaaccgcgacat gactccctgttgcctgtggacagttaccagtcttgggctctcgctggtggctggaacagc cagatgtgttgccagggagaacagaacccaccaggtcccttttggaaggcagcatttgca gactccagcgggcagcaccctcctgacgcctgcgcctttcgtcgcggccgcaagaaacgc attccgtacagcaaggggcagttgcgggagctggagcgggagtatgcggctaacaagttc atcaccaaggacaagaggcgcaagatctcggcagccaccagcctctcggagcgccagatt accatctggtttcagaaccgccgggtcaaagagaagaaggttctcgccaaggagaacctg cgggcaggctttcatcgtccgcggagctacaggcttccagcggcctggggcctcgtgggt actgagctgcgtgtgggggtccaggacccgatgtcgcctgccattcggccagggcatcgg cagatgtattggtccagcccccgagagacccaggaagaaggcaaggaggttcggtacgag ccatctcgaatggaagaagaaggcgagccaaactcccgcagcgaatctctcagcaagcta cgaaatgggatcacagggtcgcgtctacgcgggctccaggcccgaaataatttctaa >gi568815581r:48626793_48828593|GENSCAN_predicted_peptide_5|241_aa MAAVSALAQFFHCSSLKSGKGLRGYSSQRPEPALGRGRPGGRPSVAASGITCPGSAGQAL DEGKHRLTPTSPSSPALHCPRSVQVKRASAIRNSAMRNSAHKHPPAKRHSDGATAAASLH ADELIQALKEPVNQRWVNSFARPEASTQKQPQCRALTPAGQVAAETGSPKAGSTGSQGEK VRRSTLSERPGELGKPSNNKQLVFQSVGKELGKNLRVELVLHGGGSGGARSKQLQQDRPD F >gi568815581r:48626793_48828593|GENSCAN_predicted_CDS_5|726_bp atggcggccgtcagcgctctcgcccagtttttccattgttcttctctgaagtctggaaag ggcctgcggggctatagctctcagcggccggagcctgcgcttggcaggggacggcctgga ggacggcctagtgtggctgcctcgggaatcacctgccctgggagtgcaggccaagctttg gacgagggaaagcacagactgacgccgaccagcccatcctccccggccttgcactgcccg agatccgtgcaggtcaagcgggcaagtgccatccgaaacagtgccatgcgaaacagtgcc cacaagcacccacctgcaaaaagacacagtgacggagcaacagcagctgcgtccttacac gcagatgagctgatccaggctttgaaggagccagttaaccagagatgggttaattccttt gccagaccagaggcgtccacccagaagcagccccagtgtcgcgcattgactcccgccggc caagtcgccgccgaaacaggatctcctaaagcgggctccaccgggtcccagggcgaaaag gtccgaagatctacgctgtcggaaagacctggagaactcgggaagcccagcaacaacaaa cagcttgtcttccagagtgtggggaaagagctggggaagaatctaagggtggagctagtg ttgcacggtggtggcagtgggggtgcccgaagcaagcagctgcagcaagacagaccggat ttctaa >gi568815581r:48626793_48828593|GENSCAN_predicted_peptide_6|1035_aa MGWDCGLARWARVGLRERAAVQPLAPGCAAMSFAFPPFIPQGYKTAFGVGTNKIVTQDNR WELPGAWYFPRASSQAREMPQCPTLESQEGENSEEKGDSSKEDPKETVALAFVRENPGAQ NGLQNAQQQGKKKRKKKRLGLKAGEWGAMLMIGDQSIQLPAFLSSIVRRAAQQYGFREGG EDDDWTLYWTDYSVSLERVMEMKSYQKINHFPGMSEICRKDLLARNMSRMLKMFPKDFRF FPRTWCLPADWGDLQTYSRSRKNKTYICKPDSGCQGKGIFITRTVKEIKPGEDMICQLYI SKPFIIDGFKFDLRIYVLVTSCDPLRIFVYNEGLARFATTSYSRPCTDNLDDICMHLTNY SINKHSSNFSRDAHSGSKRKLSTFSAYLEDHSYNVEQIWRDIEDVIIKTLISAHPIIRHN YHTCFPNHTLNSACFEILGFDILLDHKLKPWLLEVNHSPSFSTDSRLDKEVKDGLLYDTL VLINLESCDKKKVLEEERQRGQFLQQCCSREMRIEEAKGFRAVQLKKTETYEKENCGGFR LIYPSLNSEKYEKFFQDNNSLFQNTVASRAREEYARQLIQELRLKREKKPFQMKKKVEMQ GESAGEQVRKKGMRGWQQKQQQKDKAATQASKQYIQPLTLVSYTPDLLLSVRGERKNETD SSLNQEAPTEEASSVFPKLTSAKPFSSLPDLRNINLSSSKLEPSKPNFSIKEAKSASAVN VFTGTVHLTSVETTPESTTQLSISPKSPPTLAVTASSEYSGPETDRVVSFKCKKQQTPPH LTQKKMLKSFLPTKSKSFWESPNTNWTLLKSDMNKPHLISELLTKLQLSGKLSFFPAHYN PKLGMNNLSQNPSLPGECHSRSDSSGEKRQLDVSSLLLQSPQSYNVTLRDLLVIATPAQL DPRPCRSHASAMRDPCMQDQEAYSHCLISGQKGYKATSGVVFQRRTHDFQESKALSPPTE LCQLVFVSKIKDEECDPFLLHSLFLSLIPGETFGAYEAEVEEGCAEHSASVRQKNAVAVP APKECAVEEGGSTDN >gi568815581r:48626793_48828593|GENSCAN_predicted_CDS_6|3108_bp atgggctgggactgcggcttggctcgctgggcaagggtagggctgcgggagcgagcggcg gtccagcccctggcgcccgggtgcgcggccatgtcttttgcctttcctccttttattcct caaggctacaagactgcttttggtgttggcaccaacaaaattgttacgcaagacaatagg tgggaactaccaggggcctggtatttccccagagcctcctcccaggccagggagatgcca cagtgcccgactttggaaagccaggaaggggaaaactccgaagagaagggggacagttcc aaagaagatccaaaagaaaccgtcgcgctggcttttgtgagagagaacccaggggcacaa aacggacttcagaatgcccagcagcaaggcaagaagaagaggaagaaaaagaggttagga ttgaaagctggggaatggggagccatgttgatgattggtgatcaatctatccagctgccg gcctttctttcttcaatagtgcgcagggctgcccaacagtacggctttagagagggaggg gaagacgatgactggactctctattggacagattactcagtgtcactggagcgggtgatg gaaatgaaaagttaccagaagatcaatcacttccccgggatgagtgaaatctgccggaag gacttgctggccaggaacatgagccgcatgttaaagatgttccctaaagatttccgcttt ttccctaggacctggtgtcttcctgctgactggggagatttgcagacctacagcaggtca agaaaaaataagacatacatttgtaagccggattcgggctgccaagggaaaggtatattc atcacccggacagtgaaagaaatcaaaccaggggaggatatgatctgtcagctgtatatt tcaaagccctttatcattgatgggtttaagtttgacctacggatttatgtactggtgaca tcctgtgaccctctcaggatttttgtgtacaatgaaggactggcccgctttgcgacgacc tcttactcccgcccttgcacagacaacctggatgatatctgcatgcacctgactaattat tccattaataagcacagttcaaatttcagtcgagatgcacactctggcagtaagaggaag ctctccaccttcagtgcatacttggaggaccacagctacaacgtggagcagatatggagg gatattgaggacgtcatcatcaagaccctcatctcggcccaccccatcatcaggcataac taccacacctgcttccccaaccacacactcaacagcgcctgctttgagatcctgggcttt gacattttgttggaccacaaactcaaaccctggctgctggaggtcaaccactctccaagc ttctccaccgactctcggttggataaagaggtgaaagatggtctgctgtatgacacctta gtcctgatcaacctggaaagctgtgacaagaagaaagtcttggaggaggagagacaacgg gggcagttcctgcagcagtgttgttctcgggagatgaggattgaggaagccaagggtttc cgggccgtgcagttaaagaaaactgaaacgtatgagaaggaaaactgtggagggttccga ctgatttatcccagtctgaattcggagaagtatgagaagtttttccaggacaacaactcc ctcttccagaatactgttgcttccagggctcgggaggagtatgcccggcaactgatccag gagctgagactaaaacgggagaaaaagcccttccaaatgaagaagaaggtagagatgcag ggggaatcggcaggcgagcaagtgagaaagaagggcatgaggggctggcaacagaaacaa cagcagaaagacaaggccgccacccaagcctccaaacagtacatccagccattgacatta gtatcctacacacctgacttgctcttgagtgtcagaggtgaaaggaaaaatgaaacagac agcagcctcaaccaggaggctcccacggaggaggccagctctgttttccccaagctgacg tctgcgaagcccttcagttctctacccgatctgaggaatatcaatctcagcagctcgaag ttggagcccagtaaacccaacttcagcatcaaggaggccaagtctgcctctgcagtgaac gtattcactggcactgtgcacttaacctccgtagaaaccaccccagaatccaccacccaa ctctcaatctccccaaagtctccgccaaccctggctgtgaccgccagctctgagtacagt ggcccagagacggacagggtggtatcctttaaatgcaagaagcagcagacccctccacac ttaacccagaagaaaatgttaaaatcttttctgcccacaaaatccaagagcttctgggag agtccgaacacaaactggactttgctaaagagtgacatgaacaagccacatttgatatcc gagctactcaccaagcttcaactgagtgggaagctctccttcttcccagctcactacaac cccaagctggggatgaataacctgtcacaaaacccctccctgcctggggagtgccactcc cgcagtgacagctctggcgagaagaggcagctggatgtgtcctccctcctcttgcagagt cctcagagctataatgttactctgagggacctgctggtgattgccactccagcccaactg gatccaaggccttgtagaagccacgcaagtgctatgagggacccatgtatgcaggatcaa gaagcatacagccattgcctgatctctggccaaaaaggatataaagcaacttcgggagta gtgtttcagagaaggactcatgatttccaggagtcaaaagccctcagtccaccaacagaa ctatgccagttggtcttcgtctccaagataaaagatgaggaatgtgatccatttttgctt cactcactcttcctgagtctgatccctggtgagacctttggggcatatgaggcagaggtg gaagaaggctgtgcagagcacagtgcttctgtgagacagaaaaatgctgtggcagttcct gctcccaaagagtgtgcagtggaggaaggcgggtccacagataactag