GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:39:00 Sequence gi568815589f:126732510_126933910 : 201401 bp : 45.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7086 7203 118 2 1 70 84 52 0.340 2.58 1.02 Term + 10918 10982 65 2 2 67 54 98 0.865 2.35 1.03 PlyA + 13328 13333 6 1.05 2.07 PlyA - 14109 14104 6 1.05 2.06 Term - 23041 22791 251 2 2 63 44 136 0.439 2.57 2.05 Intr - 26243 26086 158 1 2 85 110 0 0.057 1.55 2.04 Intr - 45491 45398 94 0 1 72 99 61 0.376 4.62 2.03 Intr - 47671 47570 102 2 0 93 77 47 0.593 4.25 2.02 Intr - 72563 72463 101 1 2 56 57 108 0.022 4.25 2.01 Init - 93078 92948 131 0 2 53 37 178 0.322 8.82 2.00 Prom - 94277 94238 40 -3.86 3.00 Prom + 97447 97486 40 -5.76 3.01 Sngl + 100001 101404 1404 1 0 65 42 1427 0.939 131.81 3.02 PlyA + 101439 101444 6 1.05 4.00 Prom + 112935 112974 40 -6.16 4.01 Init + 113876 114007 132 1 0 78 100 13 0.111 1.64 4.02 Intr + 122891 123001 111 1 0 54 96 79 0.305 5.88 4.03 Intr + 127881 128230 350 2 2 78 94 83 0.175 2.05 4.04 Intr + 135843 135894 52 0 1 78 62 32 0.107 -1.49 4.05 Intr + 144654 144721 68 2 2 102 100 12 0.060 1.50 4.06 Term + 146881 148405 1525 1 1 103 44 1505 0.207 137.58 4.07 PlyA + 148867 148872 6 1.05 5.03 PlyA - 148921 148916 6 1.05 5.02 Term - 150160 150080 81 1 0 123 44 31 0.143 -0.11 5.01 Init - 154606 154532 75 1 0 65 32 129 0.283 6.09 5.00 Prom - 160847 160808 40 -4.36 6.00 Prom + 165633 165672 40 -1.86 6.01 Init + 182046 182236 191 2 2 57 15 259 0.716 11.98 6.02 Intr + 182343 182503 161 0 2 60 81 119 0.710 8.03 6.03 Term + 184803 184918 116 1 2 43 54 69 0.401 -2.27 6.04 PlyA + 185133 185138 6 1.05 7.03 PlyA - 185628 185623 6 1.05 7.02 Term - 190619 190428 192 2 0 71 43 95 0.179 0.72 7.01 Intr - 195850 195744 107 2 2 75 56 68 0.168 2.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_1|60_aa MGPPLEPLRNGAILGAQCWSLLLAGQAHSSDCGQISFVKFKHLVGDMELLLVNKKSRLWN >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_1|183_bp atgggccccccactggagcctctgaggaatggggccatcttgggggctcagtgttggtct ctgctgctggcaggtcaggcacacagcagtgactgtggccagatcagctttgttaagttt aagcatcttgtaggggacatggagctgctcctggtgaacaagaagagccggttgtggaac tga >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_2|278_aa MHNHEEKTLYGPSEKEAVCKLRKKATEEPKPANILITDCPLPELPVVASAAATDSACAFS PRKRFPVPPPETDSASGDNQEDCKRGLRTGAQTKSTTFLFFVARFDFLKRGAQPGHPGTL WGSENTFLWDGLEVVLLQPVQDRMEERQMTLAASYPEATWNTYISLSKLDLMAIPVAKEP GRKITAFWPIVDGGKELGEGFDSTQRPFPQRPRAPQSRPCSSKAPAGQHPPELRCEVRNG PPKLLLPLLLPDIPHVQCVIFYDRSYIRHNQIYGSCIL >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_2|837_bp atgcacaaccacgaagaaaagactctgtatgggcccagtgagaaggaggctgtctgcaag ctgaggaagaaggccacagaagaacccaaacctgctaacatcttgatcacagactgccca cttccagaactgcctgttgtagcctcagctgccgcaacagattcagcctgtgccttcagc ccccggaaacgcttcccggtcccgccccccgaaactgacagtgcttccggggacaaccaa gaagattgcaaacgggggctcaggacaggagcccaaaccaaatcaaccacattcctgttt tttgttgcacgtttcgacttcctgaagagaggtgcccaaccaggtcatccaggcacactg tggggttctgaaaatacatttctctgggatggcctggaggtggtgctgctgcagccggtc caggacaggatggaggaaaggcagatgacactagctgcttcttacccagaagccacctgg aatacttacatttcattgtcaaaactggatctcatggctatcccagttgcaaaagagcct gggagaaaaataacagctttctggcctatagtggatggtggcaaggagcttggggaaggt tttgactcgacacagcggcccttccctcagaggcccagggccccacagagcaggccctgc tcctccaaagcacctgccggccagcacccaccggaattgcggtgcgaggttcggaacggg ccccccaagctgctcctgcccctgctgcttcccgacatcccccatgttcaatgcgtgatt ttttacgatcgatcttatatccggcacaatcagatttatggctcctgcatcttatga >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_3|467_aa MEPGTNSFRVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIFRAHKAVLAASSPYFCD QVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVSYLTAASFLQMWHVVDKCT EVLEGNPTVLCQKLNHGSDHQSPSSSSYNGLVESFELGSGGHTDFPKAQELRDGENEEES TKDELSSQLTEHEYLPSNSSTEHDRLSTEMASQDGEEGASDSAEFHYTRPMYSKPSIMAH KRWIHVKPERLEQACEGMDVHATYDEHQVTESINTVQTEHTVQPSGVEEDFHIGEKKVEA EFDEQADESNYDEQVDFYGSSMEEFSGERSDGNLIGHRQEAALAAGYSENIEMVTGIKEE ASHLGFSATDKLYPCQCGKSFTHKSQRDRHMSMHLGLRPYGCGVCGKKFKMKHHLVGHMK IHTGIKPYECNICAKRFMWRDSFHRHVTSCTKSYEAAKAEQNTTEAN >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_3|1404_bp atggagcctggaacaaactcttttcgggtagaatttcctgatttttccagcaccattcta cagaaactgaaccagcagcgccagcaaggacaattatgtgacgtctccattgttgtccaa ggccacattttccgggcacacaaagccgttcttgctgccagttcaccctacttttgtgac caggtactcctgaaaaacagcaggagaattgttttgcctgatgtgatgaacccaagagtg tttgagaacattctcctatctagttatacaggacgtctagtaatgcccgctccagaaatt gttagttacttgacagcggcaagcttcctccagatgtggcatgtggtagacaaatgcact gaagttttagagggaaaccctacagtcctttgtcagaagctaaatcatggcagtgaccac cagtcaccaagcagcagtagttataatggcctggtagagagctttgagctgggctctggg ggtcatactgattttcccaaagcccaagaactgagagatggtgaaaatgaagaggagagc accaaagacgagctgtcatcccagctcaccgagcacgaatacctgcccagcaactcgtcc acagagcatgaccgcctgagcacggaaatggcaagccaggatggggaggagggcgccagc gacagcgccgagttccactacacccggcccatgtacagcaagcccagcatcatggctcac aaacgctggatccacgtgaagcccgagcgcttagaacaggcttgcgagggcatggatgtg cacgcgacctacgacgagcaccaggtcacagagtccatcaacaccgtgcagacagagcac acggtgcagccttcgggagtggaggaggacttccacatcggggagaagaaagtggaagct gagtttgatgaacaggctgatgaaagcaattatgatgagcaggtggatttctatggctct tccatggaagagttttccggagagaggtcagatgggaatctaattgggcacagacaggag gctgccctcgcagcaggttacagtgagaatattgaaatggtaacagggattaaagaagaa gcttcccacttaggattctcagccactgacaagctgtatccttgtcagtgtgggaaaagt ttcactcacaagagtcagagagatcggcacatgagcatgcacctcggtcttcggccttac ggctgtggggtctgcggtaagaaattcaaaatgaagcaccatctcgtgggccacatgaaa attcacacaggcataaagccgtatgagtgtaatatctgtgcaaagaggtttatgtggagg gacagtttccaccggcatgtgacttcttgtactaagtcctacgaagctgcaaaggctgag cagaatacaactgaggctaactaa >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_4|745_aa MAGILTLGNVDWSEEEAIHCQHQSMKKGTEVGKCGGSCGNTVGKYCECGYETLQLMALMK IERQLHCVEKVGPVEGTLFDQQGAPRRARAEGRADAASRSRDSGVQTKAAASPRRGSGTT TGRDSGPRALARGPPRNATTRVGACPEARASRGRGREALPGAGGAGWGGSRSGGDWPGAG GGGGGAAGGRCERGALDSASMPAYFQSPRFGFGLQLFVVVVVFRNSQHFILKNLKQKRVR FMSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSP YFRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVI DKCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPT ASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDLLVRESQITEVK VKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAASQPTNV SEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAMMNNPGYESS PRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCGKKYTRKDQL EYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRSRIESPERTDVYVEQKL ENDASASEMGLDSRMEIHTVSDAPD >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_4|2238_bp atggctggaattctgaccttgggaaatgtggattggagtgaagaagaggccatccactgc caacaccaaagtatgaagaaaggtacggaggtgggaaagtgtggaggtagctgtggaaat acagtaggaaagtactgtgagtgtggttatgaaactttgcaattgatggctctgatgaaa attgagaggcaattgcactgtgttgaaaaagtgggtcctgttgaaggcacgctttttgac cagcagggggcgccgaggagagcgcgggctgaaggcagagccgacgccgcttcccggagt cgggactccggggtccagaccaaggctgccgctagcccgcggcgaggctctgggacgaca accgggcgggactcaggaccccgggccctggctagaggtccgccgcgcaacgccaccacc cgggttggcgcctgcccggaagcgcgggcgtccagggggcggggccgggaagcgcttccg ggggcggggggcgcgggctggggcggcagccggagcggcggggactggcctggcgccggc ggcggcggagggggcgccgcgggcgggcgatgtgagcgcggcgctctggacagcgcatcc atgcctgcttatttccagagcccccgctttggctttggacttcagctttttgttgttgtt gttgtttttcgaaactcccaacatttcattttgaaaaacctcaaacagaaaagagtacgc ttcatgtcagtagaaatggacagcagcagttttattcagtttgatgtgcccgagtacagc agcaccgttctgagccagctaaacgaactccgcctgcaggggaaactatgtgacatcatt gtacacattcagggtcagccattccgagcccacaaagcagtccttgctgccagctcccca tatttccgggaccattcagcgttaagtaccatgagtggcttgtcaatatcagtgattaaa aatcccaatgtgtttgagcagttgctttctttttgttacactggaagaatgtccttgcag ctgaaggatgttgtcagttttctgactgcagccagctttcttcagatgcagtgtgtcatt gacaagtgcacgcagatcctagagagcatccattccaaaatcagcgttggagatgttgac tctgttaccgtcggtgctgaagagaatcccgagagtcgaaacggagtgaaagacagcagc ttctttgccaacccagtggagatctctcctccatattgctctcagggacggcagcccacc gcaagcagtgacctccggatggagacgacccccagcaaagctttgcgcagccgcttacag gaggaggggcactcagaccgcgggagcagtgggagcgtttctgaatatgagattcagata gagggagaccatgagcaaggagacctattggtgagggagagccagatcaccgaggtgaaa gtgaagatggagaagtccgaccggcccagctgttccgacagctcctccctgggtgacgat gggtaccacaccgagatggttgatggggaacaagttgtggcagtgaatgtgggctcctat ggttctgtgctccagcacgcatactcctattcccaagcagcctcacagccaaccaatgta tcagaagcttttggaagtttgagtaattccagcccatccaggtccatgctgagctgtttc cgaggagggcgtgcccgccagaagcgggctttgtctgtccacctgcacagtgacctgcag ggcctggtgcagggctctgacagtgaagccatgatgaacaaccccgggtatgagagcagt ccccgggagaggagtgcgagagggcattggtacccgtacaatgagaggttgatctgtatt tactgtggaaagtccttcaaccagaaaggaagccttgataggcacatgcgactccatatg ggaatcaccccctttgtgtgcaagttctgtgggaagaagtacacacggaaggaccaactg gagtaccacatccggggccatacagatgataaaccattccgctgtgagatctgcgggaag tgctttccattccaaggtaccctcaaccagcacttgcggaaaaaccacccaggcgttgct gaagtcaggagtcgcattgagtcccccgagagaacagatgtgtacgtggaacagaaacta gaaaatgacgcatcggcctcagagatgggcctagattcccggatggaaattcacacagtg tctgatgctcccgattaa >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_5|51_aa MSQRLPMAFASNAVQCGLHNICSDSVAALFSYGGALVVRLRAQVLRQIPER >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_5|156_bp atgagccaacgactgccaatggcctttgccagcaatgctgttcagtgcggtctacacaac atctgctccgactcagtggctgcactgttctcctatggaggtgcactggttgtgcgtctc agagcccaggttctcaggcagataccagaacgctga >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_6|155_aa MGVAGGGPGVLAAQHPCGRGGRLQMSSEAEAREDRGCSGCALVRPAGRFACLRPYNELVG GLFCCAWPGPGPARRGPGLQAKAPLPLGRSQGHEEAAAATAARVKVTGRDCGGGESGEGK LKLGGKEDLEHVLNNEGISNIQGGEKYFINEKYCM >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_6|468_bp atgggcgtggcgggcggcgggccgggtgtgctggcagcgcagcatccctgcgggcgggga gggcgtctgcagatgagctcggaggccgaggcccgggaggaccgcggatgcagcggctgc gcactggtacgaccagccggccgatttgcgtgtctccgcccctacaacgagctggttggg gggctgttctgctgcgcctggcccggccccggcccggctcggcgtggccccggcctccaa gcgaaggcgccgctgccgctgggccgctcccagggccatgaggaagcggcggcagccact gcggcccgcgtcaaggtgaccggccgggactgcggcggcggggagagcggcgaaggaaag ctcaaacttggtggcaaagaagacctagaacatgtgttgaataatgaggggattagcaac attcagggtggagagaaatacttcattaatgaaaaatactgcatgtga >gi568815589f:126732510_126933910|GENSCAN_predicted_peptide_7|99_aa XAKGITPRWSVLVLRTPPQVNPDPSHYSSAVAFSNRIQISHWEPTIYSICSSNKHLRSIY YRPDARLGTEDLLVGKTVTAHDLMGLYCTRRDRHIQQLP >gi568815589f:126732510_126933910|GENSCAN_predicted_CDS_7|300_bp naagcaaaaggcatcacaccacgatggtcagtcctggtattaaggactccccctcaagtt aaccctgatccttctcactactcctcagctgtggctttcagcaacaggattcagatcagt cactgggagcccactatttactccatttgttcatccaataaacacttaaggagcatctac tacaggccggatgctaggctaggcactgaggacctgctggtgggcaaaacagtcacggcc catgacctgatgggactgtactgcaccaggagagacagacatatccagcaattaccctag