GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:50:00 Sequence gi568815580f:3153248_3377934 : 224687 bp : 40.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 Intr - 1841 1700 142 0 1 112 21 193 0.131 13.49 1.15 Intr - 3739 3651 89 0 2 103 -19 70 0.034 -3.50 1.14 Intr - 6986 6767 220 0 1 34 13 355 0.056 19.04 1.13 Intr - 11192 11031 162 0 0 46 107 137 0.995 10.43 1.12 Intr - 12882 12712 171 1 0 47 64 110 0.470 3.49 1.11 Intr - 15734 15570 165 0 0 67 113 158 0.996 15.21 1.10 Intr - 20753 20691 63 0 0 105 87 51 0.953 4.47 1.09 Intr - 20961 20873 89 2 2 88 56 173 0.993 12.80 1.08 Intr - 22887 22795 93 2 0 110 86 87 0.983 8.86 1.07 Intr - 34390 34233 158 1 2 60 68 45 0.025 -2.41 1.06 Intr - 35939 35501 439 1 1 77 61 516 0.319 40.39 1.05 Intr - 40711 40571 141 0 0 55 113 56 0.155 3.35 1.04 Intr - 62004 61687 318 2 0 102 71 284 0.009 22.25 1.03 Intr - 75834 75779 56 0 2 53 92 54 0.307 -0.84 1.02 Intr - 77648 77518 131 0 2 34 108 58 0.328 1.89 1.01 Init - 81546 81393 154 0 1 77 31 118 0.257 5.19 1.00 Prom - 84794 84755 40 -4.65 2.06 PlyA - 85375 85370 6 1.05 2.05 Term - 88211 88048 164 0 2 96 44 112 0.936 4.72 2.04 Intr - 88512 88395 118 0 1 88 22 99 0.860 2.42 2.03 Intr - 91011 90820 192 0 0 45 32 260 0.868 14.87 2.02 Intr - 93093 92828 266 1 2 -29 91 215 0.957 6.41 2.01 Init - 94810 94462 349 1 1 90 37 283 0.855 20.79 2.00 Prom - 95974 95935 40 -7.65 3.00 Prom + 96135 96174 40 -7.65 3.01 Init + 100001 100181 181 1 1 95 30 209 0.974 15.19 3.02 Intr + 100642 100803 162 2 0 61 119 193 0.995 18.73 3.03 Term + 102499 102671 173 2 2 106 44 267 0.999 21.11 3.04 PlyA + 102854 102859 6 1.05 4.05 PlyA - 103313 103308 6 1.05 4.04 Term - 106354 106239 116 2 2 30 48 163 0.038 4.25 4.03 Intr - 109068 108806 263 2 2 39 45 211 0.662 8.11 4.02 Intr - 110136 110112 25 1 1 90 84 53 0.738 1.27 4.01 Init - 113159 113078 82 2 1 47 45 82 0.558 0.98 4.00 Prom - 118591 118552 40 -6.45 5.00 Prom + 118778 118817 40 -7.05 5.01 Init + 119652 119835 184 2 1 95 42 248 0.999 20.23 5.02 Intr + 124006 124167 162 2 0 69 109 178 0.984 17.03 5.03 Term + 124518 124690 173 1 2 129 48 276 0.999 24.71 5.04 PlyA + 124908 124913 6 1.05 6.09 PlyA - 125240 125235 6 1.05 6.08 Term - 130388 130195 194 0 2 118 36 48 0.338 -0.80 6.07 Intr - 130943 130722 222 0 0 51 60 155 0.141 6.18 6.06 Intr - 131422 131368 55 1 1 84 94 10 0.124 -1.17 6.05 Intr - 138950 138771 180 2 0 18 80 110 0.003 2.34 6.04 Intr - 153017 152862 156 2 0 10 67 161 0.042 5.49 6.03 Intr - 166313 166219 95 0 2 92 95 70 0.420 6.86 6.02 Intr - 178351 178192 160 1 1 44 -15 148 0.015 -1.26 6.01 Init - 180987 180934 54 0 0 70 95 91 0.986 9.33 6.00 Prom - 193282 193243 40 -6.05 7.05 PlyA - 194475 194470 6 1.05 7.04 Term - 199551 199295 257 1 2 70 38 169 0.748 4.86 7.03 Intr - 203652 203559 94 0 1 32 7 152 0.387 0.12 7.02 Intr - 206121 205865 257 1 2 35 97 169 0.059 8.74 7.01 Init - 216073 215989 85 1 1 80 81 130 0.685 12.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 6986 6763 224 0 2 34 49 373 0.875 24.10 S.002 Init - 25254 25160 95 0 2 62 40 80 0.874 0.50 S.003 Intr - 32702 32640 63 2 0 70 101 65 0.813 4.00 S.004 Sngl - 53477 53178 300 2 0 44 36 241 0.874 10.04 S.005 Term - 62004 61578 427 2 1 102 53 330 0.934 24.69 S.006 Term - 178351 178187 165 1 0 44 49 150 0.969 3.63 S.007 Init - 206170 205865 306 1 0 79 97 184 0.804 15.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_1|864_aa MKLQNMQGKREMVKAARKKGSVFNGTIIRLAACFPVATVQAEGGTAIASKCCVQVLGIMN KELDKMHKQSKDRMMQQKQRFIENESILHMVGAAQLLVLVNDWKFPGAQHLETRWPGSFK GHRMSLPFYQRCHQHYDLSYRNKDVRSTVSHYQREKKRSAVYTQGSTAYSSRSSAAHRRE SEAFRRASASSSQQQASQHALSSEVSRKAASAYDYGSSHGLTDSSLLLDDYSSKLSPKPK RAKHSLLSGEEKENLPSDYMVPIFSGRIQCLRCGTLNLVKLLNLITAIHNIYILPLLCFS QKHVSGITDTEEERIKEAAAYIAQRNLLASEEGITTSKQSTASKQTTASKQSTASKQSTA SKQSTASRQSTASRQSVVSKQATSALQQEETSEKKSRKVVIREKAERLSLRKTLEETETY HAKLNEDHLLHAPEFIIKPRSHTVWEKENVKLHCSIAGWPEPRVTWYKNQVPINVHANPG KYIIESRYGMHTLEINGCDFEDTAQYRASAMNVKGELSAYASVVVKRYKGEFDETRFHAG ASTMPLSFGVTPYGYASRFEIHFDDKFDVSFGREGETMSLGCRVVITPEIKHFQPEIQWY RNVHNHFIFDLVFYTWSPVGLWHMHMIGGNTGMAVQTNVGAGSWAQWAVGSRQCEHQAAG VPLSPSKWVQTLWSGERATLTFSHLNKEDEGLYTIRVRMGEYYEQYSAYVFVRAWVTERD PVSKKRKKKKRERRRKGKEGKGRERKEEEEEEEKKKKEEEKKEEEEEKKKKEEEEEKEEE EEKEKKNSALPLPSAAPLSAIQCSESLEGKKCLAGRDADAEIEGAPAAPLDVKCLEANKD YIIISWKQPAVDGGSPILGYFIDN >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_1|2592_bp atgaagctgcagaacatgcagggtaaaagggaaatggtaaaagctgccagaaaaaaaggg agtgtcttcaatggaacaataattaggctggcagcctgctttcctgtagcaaccgtgcaa gctgaaggtggcacagcaatagcttcaaaatgctgtgtccaagttcttggcatcatgaac aaagaattggacaaaatgcacaaacaaagcaaggacagaatgatgcaacaaaagcagaga tttattgaaaatgaaagtatactccatatggtgggagcggcccaacttctagtcctggtg aatgactggaaatttccaggtgctcaacacttagagaccaggtggcccggttccttcaag gggcacaggatgtctttgcctttttatcagaggtgccaccagcactatgatctcagctac cgcaacaaggacgtgcgcagcaccgtgagtcactaccagcgggagaagaaacgctccgcc gtctacacccagggctccacggcctacagcagccgctcctccgccgcgcaccgccgggag tccgaggccttccgtcgggcgtccgcctcctcctcccagcagcaggcctcgcagcacgcc ctgagctctgaagtcagtcggaaggcagcctcagcctacgattatggctcctcccatgga cttacagattccagtctgctgttagatgattattcatccaagttgagccccaaaccaaag agagccaagcacagcctactgtctggagaagagaaagaaaatttgcccagtgactacatg gtacccattttctcaggacggatacagtgtctgagatgtggtactttgaacttagtaaaa ttactcaatcttatcactgccatccacaacatctacattttgccattgttgtgttttagt caaaagcatgtcagtggaattactgatacggaagaagaaagaattaaagaagctgctgct tatatagcccagaggaatcttcttgctagtgaggaaggaatcacaacatctaaacagtcc acggcatccaagcagaccacggcatctaagcagtccacggcatccaagcagtccacagca tccaagcagtccacggcatccaggcagtccacggcatccaggcagtctgtggtttccaaa caggccacatccgctcttcaacaggaagaaacttctgaaaagaagtcaaggaaagttgtg attcgagaaaaggcagaacgcctgtccctgaggaaaacattagaagaaaccgagacatat catgccaagctgaatgaagaccatcttctccatgctcctgagtttatcattaaacctcgc tcccacacggtttgggagaaggagaatgtaaaattgcattgctccatagcaggctggcca gaacctcgtgtcacgtggtataaaaaccaggtgccaataaatgtccatgcaaaccctgga aagtatattattgagagtcgatatgggatgcacactctggagattaatggatgtgatttt gaagatacagctcagtaccgggcctcggcgatgaatgttaaaggagagctttcggcatat gcttcagttgtggtaaaaaggtataagggagagtttgatgagactcgcttccacgctggg gcttccaccatgcccctcagctttggtgtgaccccatatggttatgcatcccggtttgag atccactttgatgacaaatttgatgtgtcttttgggagagagggagagacaatgagtcta ggctgtcgtgttgtcatcactcctgaaattaaacatttccagccagagatccagtggtac agaaacgttcataatcactttatctttgatcttgtgttttatacatggagtccagtggga ctatggcacatgcacatgatcggaggaaatacgggaatggcagtgcagacaaatgtaggt gcaggctcctgggcacagtgggcagtgggtagtaggcagtgtgagcaccaagctgcagga gtacctctttctccatcaaaatgggtgcaaacactttggagtggagagcgggcaacgctg acattttcccatctcaacaaagaagatgaaggcctctatacaatccgtgtacggatggga gaatattatgaacaatatagtgcttatgtctttgttcgagcttgggtgacagaacgagac cctgtctcaaaaaaaagaaagaaaaaaaagagagagagaagaagaaaaggaaaggaaggg aagggaagggaaaggaaggaggaggaggaggaggaagagaagaagaagaaggaggaggag aagaaggaggaggaggaagagaagaagaagaaggaggaggaggaggagaaggaggaggaa gaggagaaggagaagaagaactctgctctgcctttgccatcagcagccccactctccgcc atccagtgctctgaaagcctggaaggaaagaagtgtctcgccgggcgcgatgctgatgca gagattgaaggagccccagctgctcccttggatgtgaagtgcttggaggccaacaaagat tatatcatcatctcctggaaacagccagctgtcgatggagggagtcctattctcggatat tttattgataan >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_2|362_aa MASDPRTGDLAGTPAQVRYRRQEPRRPPAAPAETGPRDGSHAPERERHSPGTATRIRAPL RAPRCRYHTTLLPREPLKTTASGLCLASRKEARSPFLARRVGLTSRRGDREGSRQCVGSY RRGRGLSERAGSDGTNPRRLRGCVCSWSLRALAMHGHLHRVVACHHESTSSASVLSRGHF RAERNGQFSLPLSLKAAFFCFSLSQWFSIVEEWWLGFALKEHLAMSGETVLLFTNGTDTA SSGEKPGTLRNILLCTGRPSTTERYSAQKADAECLLLSRHTVQVVGVSITLWSGGQWSSS HSSTEWGPSPTPCGSCQGLGLAPFEATARAVPWPLSAMAGAAGMQGTKSLGCLQHRDPGP GP >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_2|1089_bp atggcctcggatcccagaaccggggacctggctggcacacctgcccaggtgcgctaccgg cgccaggagccccgccgccctccagctgcccccgcggagacgggtccccgggatgggtct cacgcgcccgaacgcgagcgccactcacccggcacagctacgagaatccgagcacctctc cgagcccctcgctgccgctatcacaccaccctgctacccagagagccgctaaaaaccacc gcttccggcctctgcttagcaagccggaaggaagcccggtcacctttcttggcccgcaga gtgggtctgacttcacggaggggcgacagagagggaagtcgccagtgtgtgggctcttat cggcgagggcggggactcagcgagagggctggcagcgacgggacaaaccccagacggctg cgcggatgcgtttgttcctggagcttgcgagcacttgcaatgcatggacacctccacagg gtggtggcctgccaccacgagagcaccagctccgccagcgtcctctccagaggacacttt agagcagaaagaaacggacagttcagtcttcccctcagcttgaaggcagcattcttttgc ttctctttgagtcagtggttctcaatagtggaggagtggtggttgggttttgccctgaag gaacatttggcgatgtctggagagacagttttgcttttcacaaacgggacagatactgca tctagtggggagaagccagggacgctgcgaaacatcctactatgcacaggacggccctcg acaacagaacgttattcagcccaaaaagctgacgctgagtgtctgttgctttccaggcac acagtgcaagttgtcggtgtatccatcactctgtggtctggaggacagtggtcctcttct cacagctccactgagtggggccccagcccaacaccatgtggaagctgccaaggcttgggt cttgcaccctttgaggccacggcccgagctgtaccttggcccctttcagccatggctgga gcagctgggatgcagggcaccaagtccctaggctgcctgcagcatagggaccctggacct ggcccatga >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_3|171_aa MSSKRTKTKTKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQNRDGFIDKEDLHDMLASL GKNPTDEYLDAMMNEAPGPINFTMFLTMFGEKLNGTDPEDVIRNAFACFDEEATGTIQED YLRELLTTMGDRFTDEEVDELYREAPIDKKGNFNYIEFTRILKHGAKDKDD >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_3|516_bp atgtcgagcaaaagaacaaagaccaagaccaagaagcgccctcagcgtgcaacatccaat gtgtttgctatgtttgaccagtcacagattcaggagttcaaagaggccttcaacatgatt gatcagaacagagatggtttcatcgacaaggaagatttgcatgatatgcttgcttcattg gggaagaatccaactgatgagtatctagatgccatgatgaatgaggctccaggccccatc aatttcaccatgttcctcaccatgtttggtgagaagttaaatggcacagatcctgaagat gtcatcagaaatgcctttgcttgctttgatgaagaagcaactggcaccatacaggaagat tacttgagagagctgctgacaaccatgggggatcggtttacagatgaggaagtggatgag ctgtacagagaagcacctattgataaaaaggggaatttcaattacatcgagttcacacgc atcctgaaacatggagccaaagacaaagatgactga >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_4|161_aa MRVFGFVIIHLAAFALLCHGSLNRREPVIKRKAFTRRATGGGAGVRRRRPEGSREQSRAR LTRRQTPEPGPPCARPQTTAKSVRLWPDSGGADTSCLKGRSGRRTGRRTGRRTGRRGGTK WNGEKINCTREEDLIRHTQVWHIGPAQEIESELFQNPLVPN >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_4|486_bp atgagggtgtttggctttgtcataatacaccttgcagcttttgccctgctctgtcatgga tcactcaatcggcgggagccagttataaaacgcaaagccttcacaaggcgcgcgacggga gggggcgcgggggtacggagacggcggcccgaggggtctcgcgagcagagtcgagcgcga ctcacccgacgccaaacaccagaaccggggccgccctgcgcgagaccacaaacgacagcg aagagcgttaggctgtggccggacagtggcggcgccgacacttcctgcctgaaggggcgg agcggaaggcgcaccggaaggcgcaccggaaggcgcaccggaaggcgcggcgggaccaag tggaacggggaaaaaataaactgtacccgtgaagaggacctaattcgacacacacaagtc tggcatattggacctgctcaggaaatcgaaagtgaactatttcagaatccactagtacca aactga >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_5|172_aa MSSKKAKTKTTKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQNRDGFIDKEDLHDMLAS LGKNPTDAYLDAMMNEAPGPINFTMFLTMFGEKLNGTDPEDVIRNAFACFDEEATGTIQE DYLRELLTTMGDRFTDEEVDELYREAPIDKKGNFNYIEFTRILKHGAKDKDD >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_5|519_bp atgtcgagcaaaaaggcaaagaccaagaccaccaagaagcgccctcagcgtgcaacatcc aatgtgtttgccatgtttgaccagtcacagattcaggagttcaaagaggccttcaacatg attgatcagaacagagatggcttcatcgacaaggaagatttgcatgatatgcttgcttct ctagggaagaatcccactgatgcataccttgatgccatgatgaatgaggccccagggccc atcaatttcaccatgttcctgaccatgtttggtgagaagttaaatggcacagatcctgaa gatgtcatcagaaacgcctttgcttgctttgatgaagaagcaacaggcaccattcaggaa gattacctaagagagctgctgacaaccatgggggatcggtttacagatgaggaagtggat gagctgtacagagaagcacctattgacaaaaaggggaatttcaattacatcgagttcaca cgcatcctgaaacatggagccaaagacaaagatgactga >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_6|371_aa MSKELKGRLEAVKGGGEMVIVKIESDNHHESTLKSQACSVSIRNKGDSDWLCSTYSAPLA AHGSPVPLEMLKRDISLEQGGTTAELTSVTVLPRKFSDTTSFPGFSKCVGMPITHTRQQA DLGRGPEETEDKKQQSRGSDELRSEGEEPEPYLLRTSKGQGQRKDPKSSKRKKQITYNGA PIHLAADFSVETLQAKKEWCDIFKVLKKKTFYPKMVLTAGPSHPQSVISVSFTGLANAAS LTMSPDLHGQPGFAGLSSVAAVDWQWALLRPSPPQGRMGLLPICQATQGPQLCMASPSDK EGFLLLFDPCLFLLTSLVFSYPRGRGEEQRRTQGQFQNKRKSSGVGWGRQIPLPDKRGET TVAKKKPGLRG >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_6|1116_bp atgtcaaaggagctgaagggaagactcgaggcagtaaaaggaggaggagagatggtcatc gtgaaaattgaaagtgataatcaccatgaaagtactctgaaaagtcaagcatgctctgtg agtataaggaataagggagacagtgattggctctgcagcacctattcagctcctctggct gctcatggctctcccgtgcccttggaaatgctcaagcgtgacatctccctggagcaaggt gggaccactgctgagctcacatcagtcactgtacttcctaggaagttttctgatacaacg tcatttcctggtttctcaaaatgtgtgggaatgccaataacacacactcggcagcaggct gaccttgggcgagggccagaggagacagaagacaagaagcaacagagcagaggaagcgat gagctcaggtctgagggagaggaaccagagccttatctgctgcgtacttccaaaggtcaa ggacaaagaaaggatcctaaaagcagcaagagaaagaaacaaataacatacaatggagct ccaatacatctggcagcagacttctcagtggaaaccttacaggccaagaaggagtggtgt gacatatttaaagtgctaaagaaaaaaaccttttatcctaaaatggtcctcacggccggc ccatcccaccctcaatctgttatcagtgtctctttcacaggcctggccaatgctgccagc ctgacaatgagcccggatttgcacggccagcccggatttgcaggtctctccagtgtggcc gctgtggactggcagtgggctctcctccgcccctcccctccacagggcaggatgggcttg cttcccatctgccaggcaactcagggtccgcagctctgcatggcaagcccatcagataaa gagggattcctgctcttgtttgatccttgtcttttcctgttgacatccttggtgttttca tacccacggggaagaggagaagagcagcgcagaactcaagggcaattccaaaacaaaagg aaaagctctggagttgggtggggcagacagatcccactgcctgataaacggggtgagacc acagtggctaagaagaagcctggattaagaggctag >gi568815580f:3153248_3377934|GENSCAN_predicted_peptide_7|230_aa MVLASAFGEGSRELTIMAEGEGEAAVSHGKCCAAVPLGGNPTLRVDQLWRKQRPPLQQCT ASAPGGWVTSLMGAGTHAANTGFQLVWSSKRPMYKVLQAFFGVQRSDRSAMNGQSSVDRH FDCFHTLAIVNKNAMDTEVQMFLRGGSGRRRLWQQAEQGALYSSPGASSQIPKCLYWKAR TIAVVAGSLGTVEHKPGPTRKLGHALKNAKNTLLSPRAPEGRIQFTFASK >gi568815580f:3153248_3377934|GENSCAN_predicted_CDS_7|693_bp atggtgctggcctctgcttttggtgagggctccagggagcttacaatcatggcggaaggt gaaggggaagctgctgtatcacatggaaagtgctgtgctgccgtgcccctcggtggaaat cctacactgagagttgaccagctttggaggaagcagaggcctcccctgcagcagtgcaca gcatcagcaccaggtggctgggtgacatccctcatgggagctgggactcacgcagccaat acaggttttcagttggtttggtcctcgaagaggccaatgtataaggttttgcaggccttc tttggtgttcagagatctgatagatcagccatgaatgggcagtcatctgttgatcgacac tttgactgtttccataccttggcaattgtgaataagaatgcaatggacacggaagtgcag atgtttctacgaggtggcagtggccgcagaagactctggcagcaggcagagcagggagcc ctgtacagtagtcctggtgcctcgtcacagatcccaaagtgcctttactggaaggctagg actattgcagttgttgcagggtccctggggactgtcgaacataagcctggccccacaaga aaactgggccacgcactgaagaatgcaaagaacactcttctgtctcccagagctccagag ggaagaatccagttcacttttgcttccaaatga