GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:30:40 Sequence gi568815576r:35507300_35717257 : 209958 bp : 48.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12673 12767 95 0 2 79 109 11 0.170 2.15 1.02 Intr + 15665 15778 114 2 0 79 115 37 0.366 5.06 1.03 Term + 19733 19859 127 2 1 77 46 79 0.230 0.36 1.04 PlyA + 24165 24170 6 1.05 2.05 PlyA - 24715 24710 6 1.05 2.04 Term - 26758 26112 647 2 2 94 43 1937 0.999 184.09 2.03 Intr - 33372 33319 54 1 0 130 33 27 0.498 0.35 2.02 Intr - 33608 33521 88 2 1 79 -4 94 0.894 -1.16 2.01 Init - 33994 33821 174 1 0 63 75 158 0.676 9.35 2.00 Prom - 34511 34472 40 -6.96 3.00 Prom + 34567 34606 40 -15.08 3.01 Init + 34929 35069 141 2 0 93 40 73 0.620 3.22 3.02 Intr + 39502 39781 280 0 1 122 94 503 0.928 51.35 3.03 Term + 44204 44733 530 0 2 116 53 1142 0.951 107.92 3.04 PlyA + 46678 46683 6 1.05 4.00 Prom + 56678 56717 40 -0.66 4.01 Init + 81555 81645 91 2 1 92 113 63 0.816 9.85 4.02 Term + 86660 86736 77 0 2 46 44 108 0.537 0.20 4.03 PlyA + 88009 88014 6 1.05 5.10 PlyA - 89900 89895 6 1.05 5.09 Term - 100144 99998 147 1 0 108 47 275 0.998 23.30 5.08 Intr - 103807 103585 223 0 1 110 75 322 0.644 31.23 5.07 Intr - 109966 109864 103 2 1 100 98 115 0.402 12.93 5.06 Intr - 115234 115136 99 2 0 94 68 23 0.143 0.98 5.05 Intr - 120360 120277 84 1 0 60 28 118 0.074 2.79 5.04 Intr - 127782 127656 127 0 1 92 68 27 0.322 1.35 5.03 Intr - 136008 135903 106 2 1 110 63 16 0.013 1.52 5.02 Intr - 140958 140910 49 1 1 87 77 28 0.043 -0.66 5.01 Init - 144459 144327 133 0 1 78 47 69 0.359 2.10 5.00 Prom - 144583 144544 40 -0.26 6.02 PlyA - 144867 144862 6 -0.45 6.01 Sngl - 146032 145094 939 1 0 58 43 327 0.504 21.71 6.00 Prom - 147250 147211 40 -5.16 7.00 Prom + 148163 148202 40 -3.66 7.01 Init + 148490 148641 152 1 2 69 106 16 0.892 1.02 7.02 Intr + 151316 152150 835 2 1 76 59 749 0.961 62.20 7.03 Term + 184731 184901 171 2 0 86 34 107 0.182 2.93 7.04 PlyA + 184915 184920 6 1.05 8.00 Prom + 186467 186506 40 -4.76 8.01 Init + 187953 188013 61 2 1 71 85 32 0.875 2.55 8.02 Intr + 189148 189315 168 2 0 92 97 77 0.926 8.92 8.03 Intr + 190417 190590 174 2 0 84 93 76 0.865 7.61 8.04 Term + 190661 190872 212 0 2 -14 49 191 0.577 2.46 8.05 PlyA + 191357 191362 6 1.05 9.04 PlyA - 191507 191502 6 -1.75 9.03 Term - 192292 192218 75 1 0 56 48 85 0.631 -0.86 9.02 Intr - 194961 194836 126 0 0 44 84 159 0.991 11.98 9.01 Intr - 196677 196510 168 0 0 83 88 134 0.958 13.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 160618 160790 173 2 2 74 44 126 0.815 4.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_1|111_aa MEVGKKKQPSSPDADTPMQRPSQLPERRVPSSFYSVLGPGNTKKMDTDTGGQGGTSWPSE ALGLREDTGSAPQGPCTDRCESMYQNLLSQNTQGFGDQGPRVMSPDLRLLL >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_1|336_bp atggaggtggggaaaaaaaaacagccttccagccccgacgccgacacccccatgcagcgc ccctctcagctcccagaaaggagagttccttccagcttttactctgttctaggccctgga aatacaaagaagatggacacagacactggtggacaagggggcacttcctggccttctgaa gctttgggtctgagggaggacacaggcagtgctccccaagggccttgcacagaccgctgt gaaagcatgtaccagaatctgttgtcacaaaatacgcagggttttggagaccagggacct cgtgtcatgtctccggacctacggcttcttctgtaa >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_2|320_aa MGGLRGLGGARAGGGLRLAAAGPALPRAGGLLLSQVPGGLGTSDPGVPAWGSRAGGRELS QRRRDGFFLPARRRQQLKEEQGALLCARRCPTVTRRAKRSLIPPKGLDCIPLSNVIITII TITITIILIITIAFITIIITVIITIIITIITIITLIIIPVITIIIITITIITVIIIPIIT IIIITIITIITVIIPIIAIVIITTITVIIFPIITIIIITITIITIITVIIIPIITIIIIT VIITIITIIPIITIIIIITIIITTLTTITTIIIITITTIITIIITIITTIITITVSVIII ATIIVVIIIISQAYGPFAPF >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_2|963_bp atgggtgggctccgagggctcggcggggcccgggcagggggcgggctccggctcgccgcc gccggccctgccctgccccgagccgggggcctcctgctgagccaggtgcccggaggcttg gggacgtccgatccgggggtccccgcctggggctctcgggctgggggcagagagctctcc cagcgccggagggacggcttctttctccccgcgcggcgacgccagcagctgaaggaggag cagggcgctctgctctgcgcgcgcagatgccccactgtcacccgcagggcgaagaggtcg ctgatccctccgaaaggcttggactgcatccctctttctaatgtcatcatcaccatcatt accatcactatcaccatcatcctcatcatcaccattgccttcatcaccataatcatcacc gtcatcatcaccatcatcatcaccatcatcaccatcatcactctcatcatcatccccgtc atcaccatcatcatcatcaccatcaccatcatcactgtgatcatcatccccatcatcacc atcatcatcatcaccatcatcacaatcatcactgtcatcatccccatcatcgctatcgtc atcatcaccaccatcactgtcatcatcttccccatcatcactatcatcatcatcaccatc accatcatcaccatcatcactgtcatcatcatccccatcatcaccatcatcatcatcact gtcatcatcaccatcatcaccatcattcccatcatcaccatcatcatcattatcaccatt atcatcaccacccttaccaccatcaccaccatcatcatcatcaccatcaccaccatcatc actatcattattaccatcatcaccaccatcatcaccatcaccgtcagcgtcatcatcata gccaccatcattgttgtcatcatcatcatctctcaagcctatggaccttttgctcctttt tag >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_3|316_aa MGSLQDLRLHSSPASPGCSRQPHDCQDKVPRRKEPSMCSGLLRVKSWPRAMMKTLSSGNC TLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLD ILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEA AELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFSMA KLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSVNSDLKYIKAK VLREGQARERDKCTIQ >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_3|951_bp atggggtccctgcaagacctgcgcctgcattccagtccagcctcaccaggctgctctagg cagccccacgactgccaagataaagtgccacgaagaaaagaacccagcatgtgttcaggt ttgctgcgggtcaagagctggccccgagccatgatgaagactttgtccagcgggaactgc acgctcagtgtgcccgccaaaaactcataccgcatggtggtgctgggtgcctctcgggtg ggcaagagctccatcgtgtctcgcttcctcaatggccgctttgaggaccagtacacaccc accatcgaggacttccaccgtaaggtatacaacatccgcggcgacatgtaccagctcgac atcctggatacctctggcaaccaccccttccccgccatgcgcaggctgtccatcctcaca ggggatgtcttcatcctggtgttcagcctggataaccgggagtccttcgatgaggtcaag cgccttcagaagcagatcctggaggtcaagtcctgcctgaagaacaagaccaaggaggcg gcggagctgcccatggtcatctgtggcaacaagaacgaccacggcgagctgtgccgccag gtgcccaccaccgaggccgagctgctggtgtcgggcgacgagaactgcgcctacttcgag gtgtcggccaagaagaacaccaacgtggacgagatgttctacgtgctcttcagcatggcc aagctgccacacgagatgagccccgccctgcatcgcaagatctccgtgcagtacggtgac gccttccaccccaggcccttctgcatgcgccgcgtcaaggagatggacgcctatggcatg gtctcgcccttcgcccgccgccccagcgtcaacagtgacctcaagtacatcaaggccaag gtccttcgggaaggccaggcccgtgagagggacaagtgcaccatccagtga >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_4|55_aa MGKVRVNPEDKGGHSVGVQSQGAGEGCQKQASSLGKTNLSYVTTKGSRNSGIQTL >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_4|168_bp atgggcaaggtgagggtgaatcctgaagacaaaggtggccactcggtgggtgttcagagc cagggagcaggtgaaggttgtcagaaacaagcttcctcacttggaaaaacaaatctgagc tacgtgaccaccaagggctcccggaactctggcattcagaccctgtga >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_5|356_aa MEYYAAIKKDEFVSFVGTWMKLETIILSKLSQGQKTKHRMFSLLGWTYMGYENSSCGLEL WWAFSITERREFQVYYNSCPSSSWIGDARAFSPPDPTGLYPIYSQDTQTFRLWLNYNTDF PGSPACKWQAMELLSPQSPFRVDRGEHLPFLVKGARYTLVPAGQEGALAAWLEALRGQLG RRGAVVSMMDAEGLERSSPDCAMGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHP ETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKH KIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_5|1071_bp atggaatactatgcagccataaaaaaggatgagttcgtgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcctaggatggacctatatggggtatgaaaatagctcctgtggcttagagcta tggtgggctttttccataacagaaagaagagaatttcaagtctattacaattcatgtcct tctagttcctggattggagatgcaagagcattttccccaccagatccaactgggctttac cccatctactcccaggatactcagaccttcagactctggctgaattataacactgacttt cctgggtctccagcttgcaaatggcaggctatggaacttctcagcccacaatcacccttc agggtagaccgaggagagcacctccccttcctggtgaagggagcccgatacacgctggtg ccggctggccaagaaggagccctggccgcttggctggaggctctgcgaggacagctgggg agaaggggagctgtggtcagtatgatggatgctgaggggctggagaggagcagccctgac tgcgccatggggctcagcgacggggaatggcagttggtgctgaacgtctgggggaaggtg gaggctgacatcccaggccatgggcaggaagtcctcatcaggctctttaagggtcaccca gagactctggagaagtttgacaagttcaagcacctgaagtcagaggacgagatgaaggcg tctgaggacttaaagaagcatggtgccaccgtgctcaccgccctgggtggcatccttaag aagaaggggcatcatgaggcagagattaagcccctggcacagtcgcatgccaccaagcac aagatccccgtgaagtacctggagttcatctcggaatgcatcatccaggttctgcagagc aagcatcccggggactttggtgctgatgcccagggggccatgaacaaggccctggagctg ttccggaaggacatggcctccaactacaaggagctgggcttccagggctag >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_6|312_aa MIVYLENPIVSAQNLLKLIGNFSKVSGYKINVQKSQAFLHTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDFFKENYKPLLNETKEDTNKWKNIPCSWVGRINIMKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGDITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGNDSLFNKWCWENWLAT RRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNIIQDIGMGKDFMSKTPKATA TKAKIDKGISLN >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_6|939_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataggc aacttcagcaaagtctcgggatacaaaatcaatgtgcaaaaatcacaagcattcttacac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacttcttcaaggag aactacaaaccactgctcaatgaaacaaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacattgccaagtcaatcctaagccaa aagaacaaagctggagacatcacgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagag ccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgacaaaaac aagaaatggggaaatgattccctgtttaataaatggtgctgggaaaactggctagccaca cgcagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatgg attaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcaat atcattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaacggca acaaaagccaaaattgacaaagggatctcattaaactaa >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_7|385_aa MGEIQGLNNSQRGSFEFQLQYISNNEQYIFREVARQWKRVLSLQQEQLVGKDEDDAPLCE DVELQDGDLSPEEKIFLREFPRLKEDLKGNIDKLRALADDIDKTHKKFTKANMVATSTAV ISGVMSLLGLALAPATGGGSLLLSTAGQGLATAAGVTSIVSGTLERSKNKEAQARAEDIL PTYDQEDREDEEEKADYVTAAGKIIYNLRNTLKYAKKNVRAFWKLRANPRLANATKRLLT TGQVSSRSRVQVQKAFAGTTLAMTKNARVLGGVMSAFSLGYDLATLSKEWKHLKEGARTK FAEELRAKALELERKLTELTQLYKSLQQKDIRNKMTEGMYTHCDIFNNVILYPLAIRKNI IERCTLSVISGVISSPDIRKSYINY >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_7|1158_bp atgggagagatccagggattaaataattctcaaagaggtagctttgaattccagcttcaa tacatctccaacaatgaacagtacatttttagagaagtagcaagacaatggaaaagggtt ttgagtctccagcaggagcaacttgtgggaaaggatgaggatgacgctcctctgtgtgaa gacgtggagctacaagacggagatctgtcccccgaagaaaaaatatttttgagagaattt cccagattgaaagaagatctgaaagggaacattgacaagctccgtgccctcgcagacgat attgacaaaacccacaagaaattcaccaaggctaacatggtggccacctctactgctgtc atctctggagtgatgagcctcctgggtttagcccttgccccagcaacaggaggaggaagc ctgctgctctccaccgctggtcaaggtttggcaacagcagctggggtcaccagcatcgtg agtggtacgttggaacgctccaaaaataaagaagcccaagcacgggcggaagacatactg cccacctacgaccaagaggacagggaggatgaggaagagaaggcagactatgtcacagct gctggaaagattatctataatcttagaaacaccttgaagtatgccaagaaaaacgtccgt gcattttggaaactcagagccaacccacgcttggccaatgctaccaagcgtcttctgacc actggccaagtctcctcccggagccgcgtgcaggtgcaaaaggcctttgcgggaacaaca ctggcgatgaccaaaaatgctcgcgtgctgggaggtgtgatgtccgccttctcccttggc tatgacttggccactctctcaaaggaatggaagcacctgaaggaaggagcaaggacaaag tttgcggaagagttgagagccaaggccttggagctggagaggaaactcacagaactcacc cagctctacaagagcttgcagcagaaagatattaggaacaagatgaccgaagggatgtac acccactgcgatattttcaataatgtcatcctctaccccctggctattaggaagaacatc atagagcggtgtacactttctgtgatatcgggagtaatatcctctccagatatcaggaaa agttatattaattattaa >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_8|204_aa MGVLRARGGEGLAVTPRIAGGGVRLPVILFLISRDGEHDISFHIAVGVHSPGDTDPNIQQ VEYDMTANIAMNVQPPDIRNYVTGDCTLLAILGVISSSPIMDIKKNITREVYTPCDMESN SMLSLWDIRNNITEGGCTPPAILGVISFSPPRDIRNNITVGVYTPCDIATSIIVSLPAYK EQYHKGVYTPCDIGDNIFLSPAGY >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_8|615_bp atgggggtcctaagagcaagggggggagaggggctggctgttactccccgcatcgcagga gggggtgtacgcctacctgtgatattgttcctaatatccagggacggagagcatgatatt agttttcatatcgcagtaggtgtacactcacccggtgacaccgatcctaatatccagcag gtagagtatgacatgactgccaacatagcaatgaatgtacagccacccgatattaggaac tatgtcacaggagactgtacacttcttgcgatattgggagtaatatcatcctctcccatc atggatattaagaagaatattacaagggaggtgtacaccccctgcgatatggagagtaat agtatgctctccctttgggatattaggaacaatatcacagaaggaggatgtacaccccct gcgatattgggagtaatatcattttctccccctcgggatattcggaacaatatcacagtg ggtgtgtacaccccctgcgatattgccactagtatcatcgtctccctgccagcatataag gaacagtatcacaagggggtgtacaccccctgcgatattggggataatatcttcctctcc cccgctggctattag >gi568815576r:35507300_35717257|GENSCAN_predicted_peptide_9|122_aa VEGNELLLMTVGQWTIGAQLLYEMAVPGTPVSRHSAVEGPVGAATQGSEDGEGESTWVYT APNPGVVFLISKWEEDDIADNIEGGVHLFCDMVPDIQGEQYHTGLYTFCDIGSDINLSAF GY >gi568815576r:35507300_35717257|GENSCAN_predicted_CDS_9|369_bp gttgaggggaacgagttgctcttaatgactgttgggcagtggaccataggagcccaactg ctctatgaaatggctgttcctgggacacctgtgagcagacacagtgccgtggaagggcca gttggtgcagccacccagggctcagaggatggtgaaggtgaatccacgtgggtgtacacc gcccccaaccccggggtagtgttcctaatatccaagtgggaagaggatgacattgctgac aatatcgaagggggtgtacacctcttctgtgatatggttcctgatatccagggggaacag tatcacacggggctgtacactttctgcgatattgggagtgatatcaacctctcggccttt ggatattaa