GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:09:34 Sequence gi568815589f:76919655_77120950 : 201296 bp : 42.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7451 7502 52 1 1 135 72 36 0.192 4.26 1.02 Intr + 10964 11058 95 0 2 84 59 59 0.150 1.36 1.03 Intr + 11108 11247 140 1 2 120 3 42 0.137 -2.76 1.04 Intr + 14527 14657 131 1 2 73 80 97 0.158 6.82 1.05 Term + 32986 33149 164 2 2 81 40 167 0.420 8.32 1.06 PlyA + 33250 33255 6 1.05 2.00 Prom + 33834 33873 40 -8.25 2.01 Init + 34301 34382 82 1 1 60 98 104 0.282 9.78 2.02 Intr + 58266 58483 218 1 2 59 58 112 0.141 2.60 2.03 Term + 64638 64802 165 2 0 85 44 105 0.217 2.73 2.04 PlyA + 65193 65198 6 1.05 3.00 Prom + 66980 67019 40 -4.25 3.01 Init + 69952 70045 94 0 1 66 66 81 0.774 4.29 3.02 Term + 72416 72645 230 0 2 43 38 249 0.699 11.21 3.03 PlyA + 72659 72664 6 1.05 4.03 PlyA - 73765 73760 6 1.05 4.02 Term - 77340 77131 210 0 0 26 44 192 0.611 4.91 4.01 Init - 84018 83857 162 0 0 34 37 137 0.096 2.98 4.00 Prom - 89847 89808 40 -4.75 5.07 PlyA - 90351 90346 6 1.05 5.06 Term - 95581 95408 174 1 0 117 39 110 0.681 5.78 5.05 Intr - 95999 95819 181 1 1 43 40 205 0.625 10.15 5.04 Intr - 97523 97432 92 2 2 29 69 107 0.674 0.77 5.03 Intr - 97936 97667 270 1 0 87 85 187 0.452 15.12 5.02 Intr - 98614 98305 310 0 1 69 72 177 0.703 9.59 5.01 Init - 100474 99399 1076 1 2 73 86 684 0.703 58.58 5.00 Prom - 100515 100476 40 -18.08 6.00 Prom + 100527 100566 40 -18.01 6.01 Sngl + 100649 101299 651 1 0 93 50 1056 0.994 96.22 6.02 PlyA + 101485 101490 6 1.05 7.05 PlyA - 101841 101836 6 1.05 7.04 Term - 102526 102017 510 1 0 -23 46 223 0.072 0.39 7.03 Intr - 103293 103198 96 0 0 68 51 85 0.070 2.09 7.02 Intr - 104251 104175 77 2 2 64 94 56 0.153 2.12 7.01 Init - 117834 117741 94 0 1 68 61 57 0.064 1.59 7.00 Prom - 118108 118069 40 -5.75 8.00 Prom + 119846 119885 40 -6.85 8.01 Init + 120762 121009 248 2 2 75 81 264 0.191 21.51 8.02 Intr + 131040 131158 119 0 2 1 63 127 0.011 0.69 8.03 Term + 160874 160962 89 0 2 91 35 123 0.492 4.14 8.04 PlyA + 161641 161646 6 1.05 9.04 PlyA - 163005 163000 6 1.05 9.03 Term - 188121 188005 117 0 0 112 32 100 0.949 4.36 9.02 Intr - 197181 197044 138 0 0 11 100 163 0.982 9.54 9.01 Init - 201184 201068 117 1 0 34 91 132 0.633 8.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 120762 121019 258 2 0 75 49 277 0.806 17.38 S.002 Init - 127789 127760 30 1 0 114 106 43 0.820 8.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_1|193_aa ESEVRAMDRTQATCTESITLQTSQWVMQVDECMCRNLNYLLLGTVDLCEVDSGVWGWRQR KGLTRPKVLPELRGKGGHSIFKLEQDIFIDLQVALWFHKWKHICQQAQSEILLDANAAEN ALPLSQESLQTSYCFCLAPETFLVAMMKGVLLASGSKDQDAPKLSTVHRTAPNNRELSGL KFHIAKVEKPYIG >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_1|582_bp gaaagtgaggtgagggcaatggaccgcacacaggcaacatgcacagaatcgataaccctg caaacatctcagtgggtgatgcaggtggatgaatgtatgtgcaggaacctcaactacctc ctgctgggcactgtagacctatgtgaggtggattctggagtgtggggttggaggcagagg aaaggtctcaccagacctaaggtcttaccagagctaagaggaaaaggaggacattccatt ttcaagcttgaacaagacatctttatagatctccaagtggcactctggtttcataaatgg aagcacatctgccagcaagctcagagtgagattctcttagatgcaaatgcagctgaaaat gcattgcctctttcacaagagtccctgcaaacatcttattgtttctgcttggctccagaa acatttttggttgccatgatgaaaggtgtgctgttagcatctggtagtaaagaccaggat gctcctaaactttctacagtgcacagaacagcccccaacaacagagaattatctggcctc aaatttcatattgccaaagttgagaaaccctatattggatga >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_2|154_aa MGEMKGDVDKIQGQGEGLALAARAGGEENQPPGLLIHLPGLDSRTGLQDVSLEELNGPKE YIYRLRHLGPPQVSPTCTEIPVSMLVPHSEISMHSHKPLDLGHSSIGLLWFAGGPLQIRV TSVPPWTYHQRRLRNSKDGSLLLPLGAPSQGAPT >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_2|465_bp atgggagaaatgaaaggtgatgtggataaaatccagggacaaggagaaggactggctttg gctgcaagggcaggaggtgaagagaaccagccaccaggcttattaatacatcttccagga ctggattccaggaccggactccaggacgtttctctggaagaactgaatggtcccaaagaa tacatctataggctcaggcatttaggacccccccaggtttcacccacctgcacagagatt ccagtcagcatgttagtgcctcactctgaaatatcaatgcacagccacaaaccactagac ttaggccactcttccatagggctgctgtggtttgctgggggtccactccagatccgagtc acctcagtccctccctggacgtatcaccagcgaaggctacgaaacagcaaagatggcagc ctgctccttcctctgggggctccatcccagggggcaccaacctga >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_3|107_aa MGSMSLMQCTFADQSPVTAIAAIGKTACCNQPLPFTPPDPVAAARDFDSQDGGGDESSMG TLVQKVRGGGTVSRDHWEVEDEDIKNNWEDNDKKEKKQRGSRSKTRD >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_3|324_bp atgggctccatgtcattgatgcaatgcacatttgctgaccagtcccctgttactgctata gcagccattggcaagacagcctgctgcaaccaacctctccccttcacaccgccagaccca gtggcagcggcgagggactttgactcccaggatgggggtggggatgagtcctcaatggga accctggtgcagaaggtgcggggtggtggcactgtcagcagggaccactgggaagttgag gatgaggacatcaaaaataactgggaagataatgacaaaaaagaaaaaaaacaaagagga agcagaagtaaaaccagagattaa >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_4|123_aa MVLRYTVATVVLDTGEERTYCTLVKGTGCLEDCKKETGGKPPWAVSQLLFAACLDQSLLS PEAHRTSDTHTVLGMATEQRGQQREQESTDELQSWASTLKGPLGHRPPQAPPRSRAQHVP VVD >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_4|372_bp atggttctaagatacactgttgcaactgttgtattggatactggagaagagagaacttat tgcacccttgtaaaaggcactggctgccttgaagattgtaaaaaagaaactggaggcaag ccaccctgggcagtcagtcagctgctttttgctgcctgcctagaccagtctttgctgtct cctgaggcccatcgtacatctgacacacatactgttttggggatggccactgagcagcgg ggccagcagagggagcaggagtcaacagatgagctgcagagctgggccagcaccttgaag gggcctctgggacacaggccaccgcaggcaccacctcgttccagagctcagcatgttccc gtggtggactga >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_5|700_aa MVVVCGSVVMMVVVGVVVVMGVKVPSRTGARRALGASRVQVSMVGAQHLEALATPQEAAV LEHVPAVGVQRPEATLARLVGPPRNLDEAVVEGEVVAQAVLPALCVLAVVGEALHDELVD VAQRQHLLGRVLDCHGGQRDVRVGRFLVAVRALPRPRHRPRLLHRPPARTGPQNLSALRS WGLAAPTNSHSAPPVPPLCPLALQALSASLSCLLRIAWCLRPWPFGPTQLSEAGNFWPRR LPTGAHSRGRSAPGLPPPSLRLPPGGQRLSAGAGCRGEWQGAVFPPGLFAFSRSEGNPDQ DGWEEGLGSRDSGPLNLSLRLLSWRPVTAGLPVPWTQAPGQALSQTALEFSFSWSEKSGH SGSSSEYRVLKASSLQETPMIRGLEGKTHGEESMGCFHQMPNSSQFYCAPTACEAPGAGR QETVSELHHRGIEGETNSNQEINRIGTKGLGEWRLHHTFLERLGGKAEFRNKYTLRLWDC FIICKIGWLKLEIHGSFPVRRFEEKGAFGHYGSLGFPTPNLEESHRPSCLGGCPLTSNLG DTRSGVFSFWLLARRRLRIDVTTEACKKARGKEQLLGPKPFSKPGLPTASADSCAQGEEE HKLPIGLGRRTALEKRQRFRRTSQFPWVLPLRTDLADVRKVAWEILSSQNCAFEGSKELR FEAVISTVEWRWGSGRSERGAQELRSWGRCRAAAPPVNPL >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_5|2103_bp atggtggtggtgtgcggcagcgtggtgatgatggtggtggtgggggtggtggtggtgatg ggggtgaaggtgccctcccggaccggcgcccggcgcgctcttggtgcttcccgcgtgcaa gtgagtatggtcggcgcgcagcaccttgaagcgcttgcgacgccgcaggaagctgccgtt ctcgaacatgtccccgcagtcggggtgcagcgcccagaagctacccttgccaggctggtc gggcctccgcggaatcttgatgaagcagtcgttgaaggagaggttgtggcgcaggctgtt ctgccagcgctgtgtgtgctcgcggtagtaggggaagcgctccatgatgaacttgtagat gtcgctcagcggcagcatcttctcggccgagtgctggattgccatggcggtcagcgagat gtaagagtagggcggtttttggtcgctgtacgagctcttccccggccgcggcatcgcccc cggctcctccaccgccctccggcccgcactggaccacagaatctctccgcactccgttcc tggggcctggctgcccccactaattcccactcggcccctccagtcccgccactttgccct ctcgccctccaggccctctctgcttctctctcctgccttctccgaattgcctggtgtctc cgaccttggcctttcgggcctactcagctctcggaagctggcaacttctggccccgacgg ttgccgactggcgcgcacagtcgcgggcggagtgcaccaggtttaccgccgccttctctg cgtcttcccccgggtggccaaagactgagtgctggagctggctgcagaggagagtggcaa ggggccgtgtttcctcccggcctctttgctttttctaggtccgagggaaacccagaccaa gatggctgggaggaaggtctgggcagtagagactctggccctttaaatctgagtttgcgc ctgctgtcctggaggccagtgacagcaggacttccagtgccctggacgcaggcccctggc caggcacttagccagaccgcgctggagttctccttttcctggagtgaaaaatcaggacac tctgggtcaagttcagagtatcgtgtcctcaaggcctcttctctgcaagaaaccccgatg attagagggctggagggaaaaactcatggggaagaaagcatggggtgttttcaccaaatg ccaaattcatctcagttttattgcgcgccaactgcatgtgaggcaccaggtgcagggaga caggagacagtatccgagctccaccatcgcggtatagagggtgagacgaacagtaaccaa gaaattaatagaataggcactaaagggcttggcgaatggaggctccaccacactttcctg gagcgactgggaggaaaagcagagtttagaaacaagtatactctccgtctctgggactgt ttcattatttgtaaaatagggtggctgaagttggaaattcacgggtcctttcccgttagg cgatttgaggagaaaggagcatttggacactatggctccctcggttttcccacgcctaac ctggaagaaagccatcggcccagctgcttggggggctgccccctaactagcaaccttggt gacaccagatcgggtgttttcagcttctggcttttggccagaagacgtctgagaatagat gttacgactgaagcttgcaagaaggcccggggaaaagagcagcttcttggtcctaagccc ttttcaaagccaggccttcccacggcttctgcggacagctgtgcccagggcgaggaggaa cacaagttaccgatagggctaggacgacgtacagcgctagaaaaaaggcaaaggtttagg cgcacatctcaatttccatgggttctccctttacgcactgacttagctgacgtgcggaag gttgcctgggaaatactcagtagtcagaactgcgccttcgaaggctccaaggagttgagg tttgaagctgtgatctcgacggtggagtggcgctgggggtctgggcgctcggagcgggga gctcaggagctgcgcagctggggacgctgcagggcagcagcgccacccgtcaacccctta taa >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_6|216_aa MQEAAAVAAAAAAAAAAAVGSVGRLSQFPPYGLGSAAAAAAAAAASTSGFKHPFAIENII GRDYKGVLQAGGLPLASVMHHLGYPVPGQLGNVVSSVWPHVGVMDSVAAAAAAAAAAGVP VGPEYGAFGVPVKSLCHSASQSLPAMPVPIKPTPALPPVSALQPGLTVPAASQQPPAPST VCSAAAASPVASLLEPTAPTSAESKGGSLHSVLVHS >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_6|651_bp atgcaggaggcggcggccgtggcggcggcggcggcggcggccgcggcagccgcggtgggc agcgtgggacgcctgtctcagttcccaccctacgggctgggctcggccgccgccgctgcc gccgcggccgcggcgtccacgtcaggcttcaagcacccctttgccattgagaacattatt ggccgggactacaagggcgtgctgcaggctggagggctgcccttggcgtccgtcatgcac cacctgggctaccccgtgcccggccagcttggcaacgtcgtcagctccgtgtggccgcac gttggcgtcatggattcggtggccgccgccgcggccgccgcagccgcagccggagtccct gtaggcccggagtatggggccttcggggtcccggtcaagtccctgtgccactcggcaagc cagagcctgcctgccatgccggtgcccatcaagcccacgcctgcgctgccgcccgtgtcc gcgctgcagccggggctcactgtccccgcggcttcgcagcagcctccggcgccatccacc gtgtgctccgcggccgcggcctcgcccgttgcctctctgctggagcccacagcccctacc tcggccgaaagcaagggcggctccttgcactcggtgctagtgcactcctag >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_7|258_aa MEWKDMEDTNRVGQERLYLTLEESALRRPTKGRFGPHLSPESRTEGTPKKHTGPFRQDRR LKGAPSGTHPADTEFLALDGSRGPQNGRERGEGQERQGKFLSHSRIETEELIRLKDISAQ TAHGIQKQPQRAKVTGWSPCFRPRRLEGNSHGTFLQALRAILSSDVPSFSCIRKLGKQNP AIQVICFLKIPNRKDRGNSHYSWALIRLIPKQVAGGKSPVLSPFGCVSQLLEFGALPFKL LPFSFVQTNCSRGSSYPS >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_7|777_bp atggaatggaaggacatggaagatacaaatagagtcggacaagaaagactttacctaact ctggaggaatcagctttgaggagacctacaaagggtcgctttgggcctcacctctctcct gaaagcaggaccgagggcacccctaaaaagcacactgggcctttcaggcaggaccgcaga cttaagggggcgccgtcgggcacccaccccgctgacaccgagttcctggccttagatggt agtcgaggacctcaaaatgggagagagaggggggaagggcaagagcgacaaggcaagttt ctttcccattcacgaattgaaactgaagaattaattcgcctgaaagacataagcgctcaa acagcacacggaatacaaaaacagccgcagcgtgcaaaggtgactgggtggtccccttgc ttccgtccgaggagactggaagggaacagccatggaacgtttctgcaagcgttaagagca atattgtcgagtgatgtgccgagtttttcctgcattcgtaagttagggaaacaaaatccc gctatacaagtgatttgtttccttaaaattccaaatagaaaggatcgtgggaattcacat tattcttgggctctcatccgactgatacccaaacaagttgcaggtggaaagtcgcctgta ttgtctccctttggttgtgtttcccaacttttggaatttggggctttaccctttaagctg ctgccattctcttttgtacaaactaattgctctagaggttcctcctatccttcttga >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_8|151_aa MTSVIPVKDKKHLEVKLGELPSWILMQDRVIAAGIQRGYYWYYNKYINVKKGSISGLTMV LAGYMLFRYCLSYKELKHKRLCSEENPLVGKSNVKDTNQEAIGVNQLMSLDEVGVEKDKM KKGISTAAAGQEECTIMEDLTLGIGKTKFAL >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_8|456_bp atgacatcagtcataccagtgaaggacaagaaacatctggaggtcaaactaggggagctg ccaagctggatcttgatgcaggatcgagtcattgccgcagggattcaaagaggttactac tggtactacaacaaatacatcaacgtgaagaaggggagcatctcggggctgaccatggtg ctggcaggctacatgctcttccgatactgcctttcctacaaggagctcaagcacaagcgg ctatgcagtgaagagaatccattagtagggaagagtaatgtcaaagataccaatcaagag gccattggagtaaatcaactgatgagtttggatgaagtaggtgtagagaaagataagatg aagaaaggaatttctacagcagcagccggacaggaagaatgcaccatcatggaagatcta actctggggattgggaagaccaaatttgctctctaa >gi568815589f:76919655_77120950|GENSCAN_predicted_peptide_9|123_aa MKGGFFKGRSSLVPRATGSFSISGVDEEHSTCNIIEAFVTWIAYNDEVIDAKHLLNDELV YQNKPVDTEQTGIFRAPQVQILLLPHFFFPEPLWNEDRMTYYRTRVYREFLYDQLQDRKA KED >gi568815589f:76919655_77120950|GENSCAN_predicted_CDS_9|372_bp atgaagggcggcttctttaagggaagaagctctctggtccccagagccactggatccttc tcaatctctggtgttgatgaggaacactccacatgcaacatcatcgaggcgtttgtgaca tggatagcctataacgatgaagttatagatgcaaagcacctgcttaatgatgagctggtg taccagaacaagcctgtggatacagaacagacgggcatttttagagcaccacaggtccaa attctgctgttgccacatttcttctttccagaacccctatggaatgaagatcgtatgacc tactatcggacaagagtatatcgagaatttctgtatgaccagctccaagacagaaaggca aaggaagattaa