GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:55:29 Sequence gi568815577f:36599747_36845270 : 245524 bp : 46.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20496 20751 256 2 1 55 40 183 0.624 7.59 1.02 Term + 20994 21721 728 1 2 28 49 291 0.332 12.84 1.03 PlyA + 21763 21768 6 -1.75 2.00 Prom + 21803 21842 40 -1.86 2.01 Init + 25354 25411 58 0 1 84 53 50 0.628 2.57 2.02 Term + 27264 27367 104 1 2 108 44 145 0.805 10.54 2.03 PlyA + 28019 28024 6 -0.45 3.11 PlyA - 28230 28225 6 1.05 3.10 Term - 31139 31115 25 1 1 71 49 44 0.164 -3.50 3.09 Intr - 33376 33228 149 1 2 100 78 80 0.652 7.23 3.08 Intr - 40380 40260 121 2 1 51 27 118 0.263 2.50 3.07 Intr - 41126 41030 97 0 1 82 74 37 0.152 0.77 3.06 Intr - 43264 43151 114 2 0 104 57 41 0.369 3.02 3.05 Intr - 54152 54014 139 2 1 -12 70 86 0.008 -3.16 3.04 Intr - 66511 66414 98 2 2 85 99 75 0.772 8.03 3.03 Intr - 71826 71733 94 0 1 119 100 43 0.865 8.14 3.02 Intr - 81418 81356 63 1 0 76 100 32 0.381 2.11 3.01 Init - 86516 86511 6 2 0 81 89 4 0.243 0.54 3.00 Prom - 89719 89680 40 -5.66 4.00 Prom + 92136 92175 40 -4.66 4.01 Init + 95210 95359 150 1 0 59 64 81 0.113 2.85 4.02 Intr + 97568 97622 55 1 1 53 90 61 0.187 1.35 4.03 Intr + 98168 98244 77 0 2 79 116 -3 0.279 0.83 4.04 Intr + 98905 99028 124 0 1 96 20 119 0.317 6.06 4.05 Intr + 99336 99633 298 1 1 73 57 139 0.056 5.23 4.06 Intr + 99991 100175 185 1 2 -52 100 347 0.035 21.33 4.07 Intr + 101097 101179 83 1 2 65 43 27 0.382 -4.74 4.08 Intr + 105121 105433 313 0 1 85 36 226 0.947 12.86 4.09 Intr + 107373 107639 267 1 0 43 72 129 0.275 4.30 4.10 Intr + 109422 109504 83 1 2 101 80 61 0.906 5.96 4.11 Intr + 112787 112876 90 1 0 97 94 12 0.856 2.89 4.12 Intr + 119830 119988 159 0 0 57 84 38 0.516 0.48 4.13 Intr + 120075 120183 109 2 1 99 61 132 0.964 11.46 4.14 Intr + 123299 123384 86 0 2 104 102 14 0.982 3.94 4.15 Intr + 126373 126572 200 0 2 116 109 287 0.993 31.75 4.16 Intr + 131299 131405 107 1 2 101 75 223 0.967 22.16 4.17 Intr + 141971 142118 148 0 1 81 113 213 0.996 22.39 4.18 Intr + 143641 143809 169 1 1 113 92 98 0.998 12.65 4.19 Intr + 144919 145390 472 0 1 46 98 353 0.487 24.75 4.20 Term + 147919 148346 428 2 2 121 46 611 0.985 55.67 4.21 PlyA + 149221 149226 6 1.05 5.05 PlyA - 149589 149584 6 1.05 5.04 Term - 154671 154500 172 2 1 119 38 216 0.896 17.00 5.03 Intr - 157009 156796 214 2 1 85 82 294 0.999 26.27 5.02 Intr - 160095 159981 115 0 1 80 111 76 0.999 9.12 5.01 Init - 160425 160378 48 0 0 52 121 32 0.448 3.85 5.00 Prom - 163981 163942 40 -1.46 6.00 Prom + 177006 177045 40 -0.96 6.01 Init + 182806 182946 141 0 0 76 61 55 0.224 1.73 6.02 Intr + 186205 186274 70 0 1 135 44 52 0.201 4.25 6.03 Term + 195455 195576 122 0 2 76 42 153 0.492 8.14 6.04 PlyA + 196087 196092 6 1.05 7.00 Prom + 209165 209204 40 -3.46 7.01 Sngl + 220804 221256 453 0 0 50 55 244 0.300 11.52 7.02 PlyA + 222774 222779 6 1.05 8.02 PlyA - 222787 222782 6 1.05 8.01 Term - 236481 236297 185 1 2 93 49 101 0.662 4.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_1|327_aa MNIDAKILNKILANRIQQHIKRLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTNDKNHM IISVDAEKAFDKIQQPFMLKTLNKLDDMIVYLENPIVSAQSLLKLISNFSKVSGYKINVQ KSQAFLYTYNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPMLIEIKEDT NKWKNIPCSWIGRINIVKMAILPKVIYRFNAMPIKLPMAFFTELEKTTLKFIWNQKRARI AKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHVYNYLI FDKPDKNKKWGKDSTFNKRCWENWLAI >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_1|984_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaggcttatccaccatgatcaagtgggcttcatccctggaatgcaaggctggttcaac atacacaaatcaataaatgtaatccagcatataaacagaaccaatgacaaaaaccacatg attatctcagtagatgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaa actctcaataaattagatgacatgattgtatatctggaaaaccccatcgtctcagcccaa agtctccttaagctgataagcaacttcagcaaagtctcaggatataaaattaatgtacaa aaatcacaagcattcttatacacttataacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccaatgctcattgaaataaaagaggataca aacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttatagattcaatgccatgcccatcaagctaccaatggctttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatt gccaagtcaatcctgagccaaaagaacaaagctggaggcatcacgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gaccaatggaacagaacagagccctcagaaataatgccgcatgtctacaactatctgatc tttgacaaacctgacaaaaacaagaaatggggaaaggattccacatttaacaaaaggtgc tgggaaaactggctagccatatga >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_2|53_aa MTVSFLRLPQPYFLYSLQNNQTTFSSYAAPGMNASEALQLLEADIGLKEPMSP >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_2|162_bp atgactgtaagcttcctgaggcttccacagccatacttcctgtacagcctgcagaacaac caaacaaccttcagctcctatgcggcacctggcatgaatgcttctgaggctctgcagctg ctggaggcggacattggcctaaaggagcccatgagtccatga >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_3|301_aa MVALNDDFYIFSTIRLVWFPGAEAQLSFVCVCLFHGIPEDFHLQEYVMSEILLGTGSSAL SLSGSPGQRGNIQYSEGEGKQQTFRDQRHGQLRLKEPSRQNIKGRRPLIETNGGTIKGRR PVRCGLDVFKDQKDRRQLECRLPSAPGWDPVFKCNSQASCPLLCVLLDCALAWIKGGPHI VQQSRWERLAPWDPASWPRSADMGWKHYSNESCETDGLWPRLLNTDTLLINTQHPDILIS AQILGVVAAGASSAWPGSPIAHSCQCAHAQTCTIPVQPLQLPPGIFSQDLEAVRKGELMR P >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_3|906_bp atggtggccctgaatgatgatttttatatcttcagcaccattagacttgtatggtttcct ggagctgaagctcagctttcctttgtctgtgtctgtctatttcatggcatccctgaagac ttccacctccaggaatatgtcatgtctgagatcttgttgggaactggatccagtgccctg agcctctctggttcacctggtcagagaggaaatatccagtattctgaaggggagggcaag cagcagacattcagagatcagcggcatggacagctgaggctgaaagagccctccaggcag aacatcaagggcagaaggcccttgatagagaccaatggagggaccatcaagggcagaagg ccagttaggtgtgggcttgacgtcttcaaggaccagaaagatagacgccagctcgaatgc cgcctcccgtctgctcctggttgggatcctgtcttcaagtgcaacagccaagccagctgc ccactcctctgtgtcctcttagattgtgctctggcttggataaagggtggaccacacatt gttcagcagagcagatgggagcgcctggcgccctgggatcccgcatcctggccgcggtct gccgacatgggttggaagcattattcaaatgagtcctgtgaaactgatggcctgtggccc cggttgttaaacaccgacacgctgctcattaacactcagcatccggatatcctaatttca gctcagattttgggtgtggtggcagcaggtgcctcttctgcatggccagggtcccccatc gctcacagctgtcagtgtgcacatgctcagacctgcaccattcctgtccagcccctgcag ttaccacctggaatcttcagccaggacttggaggctgtgaggaaaggcgagctcatgcga ccctag >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_4|1200_aa MQILGQETLPEDPLSLPIPEPEVIPDNQVQAHEPRRSASGSSLPPQAAQERPGAGSCQRS QERDPGRGGHAPHCSTALQPVAHGRESPHDLDAKLLDLQHREGEDAGPAHPRKARSSPAG RLANESRRLLSAAPAGGQCSARRRVCHKQTRLGRTWTAEVLRLATHRGLRRCVSRHREPP GRAGERARARARPQPPATRSPPLGSRSMAPRPPPPLSPSGPEGPCRSRFEHSTKDRGAMK EKSKNAAKTRREKENGEFYELAKLLPLPSAITSQLDKASIIRLTTSYLKMRAVFPEGLKM CQWARGAGTRGAEDSEQEEIVARWFGKRRKKRAFQGPRGEHEGSDAGEGLKRAEAPQIAT IYWDPFVGKGGLEAQAIGCPRATRREPGPKLPPYALAQVPGDWCSQGSPLSLPPLQGVVL RLLGHKRRGQRWLQDNCPTPGTTQQQRKGQLPGCLCRFLEIRIPGSLVKSYFTKSQDLHI LVLAIANSCKCCRHEAWEEGPAGESTGSAAVITVVMRGLGDAWGQPSRAGPLDGVAKELG SHLLQTLDGFVFVVASDGKIMYISETASVHLGLSQGSLGCKLKPQCHGLFRQCGRQFRAL SSVEKRKLFLRLAKVWVVSRDQSSVPGRVELTGNSIYEYIHPSDHDEMTAVLTAHQPLHH HLLQEYEIERSFFLRMKCVLAKRNAGLTCSGYKVIHCSGYLKIRQYMLDMSLYDSCYQIV GLVAVGQSLPPSAITEIKLYSNMFMFRASLDLKLIFLDSRVTEVTGYEPQDLIEKTLYHH VHGCDVFHLRYAHHLLLVKGQVTTKYYRLLSKRGGWVWVQSYATVVHNSRSSRPHCIVSV NYVLTEIEYKELQLSLEQVSTAKSQDSWRTALSTSQETRKLVKPKNTKMKTKLRTNPYPP QGRKAASCRPLAGPSSIFFAMQQYSSFQMDKLECGQLGNWRASPPASAAAPPELQPHSES SDLLYTPSYSLPFSYHYGHFPLDSHVFSSKKPMLPAKFGQPQGSPCEVARFFLSTLPASG ECQWHYANPLVPSSSSPAKNPPEPPANTARHSLVPSYEAPAAAVRRFGEDTAPPSFPSCG HYREEPALGPAKAARQAARDGARLALARAAPECCAPPTPEAPGAPAQLPFVLLNYHRVLA RRGPLGGAAPAASGLACAPGGPEAATGALRLRHPSPAATSPPGAPLPHYLGASVIITNGR >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_4|3603_bp atgcaaatcctgggacaggagacactgcctgaggaccctctctcactcccaatcccagaa cccgaagttatccccgacaaccaagtccaagcacatgaaccaagacgatcagcttcaggc agctccttacccccacaagcggcccaggagaggcccggagccggcagctgtcagcgcagc caggagcgggatcctgggcgcggaggtcacgcaccccactgctccacggctctgcagcct gtggcacacggccgagagtccccacatgatctcgacgccaagctcttggacctgcaacac cgggagggcgaggacgcgggaccagcgcaccctcggaaggctcgatcctccccggcaggg cgcctggccaacgagtcgcgccgcctcctctcggccgcgcctgctggcggccagtgctcc gcccgaaggcgggtctgccataaacaaacgcggctcggccgcacgtggacagcggaggtg ctgcgcctagccacacatcgcgggctccggcgctgcgtctccaggcacagggagccgcca ggaagggcaggagagcgcgcccgggccagggcccggccccagccgcctgcgactcgctcc cctccgctgggctcccgctccatggctccgcggccaccgccgcccctgtcgccctccggt ccggaggggccttgccgcagccggttcgagcactcgacgaaggaccgaggcgcgatgaag gagaagtccaagaatgcggccaagaccaggagggagaaggaaaatggcgagttttacgag cttgccaagctgctcccgctgccgtcggccatcacttcgcagctggacaaagcgtccatc atccgcctcaccacgagctacctgaagatgcgcgccgtcttccccgaaggattgaaaatg tgccagtgggccaggggcgctgggacccgcggtgcggaagactcggaacaggaagaaata gtggcgcgctggtttggaaaaaggcgcaagaagcgggcttttcagggaccccggggagaa cacgagggctccgacgcgggagaaggattgaagcgtgcagaggcgccccaaattgcgaca atttactgggatccttttgtggggaaaggaggcttagaggctcaagctataggctgtcct agagcaactaggcgagaacctggccccaaactccctccttacgccctggcacaggttccc ggcgactggtgttcccaagggagccccctgagcctaccgcccttgcagggggtcgtgctg cggcttctgggtcataaacgccgagggcaaaggtggctccaggacaactgcccaacccca ggaacgacccagcagcagagaaaaggacagctgccagggtgcctttgtcgctttttggaa atcagaattcctgggtccttagttaagtcttacttcaccaaatcccaggaccttcacatt ttggttcttgccattgctaacagttgtaaatgctgccgccacgaggcctgggaggaagga cccgctggtgagagcacagggagtgctgctgtgatcacggtggtgatgcggggtttagga gacgcgtggggacagccgagccgcgccgggcccctggacggcgtcgccaaggagctggga tcgcacttgctgcagactttggatggatttgtttttgtggtagcatctgatggcaaaatc atgtatatatccgagaccgcttctgtccatttaggcttatcccagggttctcttggctgc aagctcaagcctcagtgccatggcctttttagacagtgtgggagacagttcagagctctg agctctgttgagaaaagaaaactgtttctgcgtttggcaaaggtgtgggttgtcagcaga gatcaaagttctgttcctggcagggtggagctcacgggcaacagtatttatgaatacatc catccttctgaccacgatgagatgaccgctgtcctcacggcccaccagccgctgcaccac cacctgctccaagagtatgagatagagaggtcgttctttcttcgaatgaaatgtgtcttg gcgaaaaggaacgcgggcctgacctgcagcggatacaaggtcatccactgcagtggctac ttgaagatcaggcagtatatgctggacatgtccctgtacgactcctgctaccagattgtg gggctggtggccgtgggccagtcgctgccacccagtgccatcaccgagatcaagctgtac agtaacatgttcatgttcagggccagccttgacctgaagctgatattcctggattccagg gtgaccgaggtgacggggtacgagccgcaggacctgatcgagaagaccctataccatcac gtgcacggctgcgacgtgttccacctccgctacgcacaccacctcctgttggtgaagggc caggtcaccaccaagtactaccggctgctgtccaagcggggcggctgggtgtgggtgcag agctacgccaccgtggtgcacaacagccgctcgtcccggccccactgcatcgtgagtgtc aattatgtactcacggagattgaatacaaggaacttcagctgtccctggagcaggtgtcc actgccaagtcccaggactcctggaggaccgccttgtctacctcacaagaaactaggaaa ttagtgaaacccaaaaataccaagatgaagacaaagctgagaacaaacccttacccccca cagggccggaaggcagcttcctgccggcccctcgctggcccttcatccatcttctttgcc atgcagcaatacagctcgttccaaatggacaaactggaatgcggccagctcggaaactgg agagccagtccccctgcaagcgctgctgctcctccagaactgcagccccactcagaaagc agtgaccttctgtacacgccatcctacagcctgcccttctcctaccattacggacacttc cctctggactctcacgtcttcagcagcaaaaagccaatgttgccggccaagttcgggcag ccccaaggatccccttgtgaggtggcacgctttttcctgagcacactgccagccagcggt gaatgccagtggcattatgccaaccccctagtgcctagcagctcgtctccagctaaaaat cctccagagccaccggcgaacactgctaggcacagcctggtgccaagctacgaagcgccc gccgccgccgtgcgcaggttcggcgaggacaccgcgcccccgagcttcccgagctgcggc cactaccgcgaggagcccgcgctgggcccggccaaagccgcccgccaggccgcccgggac ggggcgcggctggcgctggcccgcgcggcacccgagtgctgcgcgcccccgacccccgag gccccgggcgcgccggcgcagctgcccttcgtgctgctcaactaccaccgcgtgctggcc cggcgcggaccgctggggggcgccgcacccgccgcctccggcctggcctgcgctcccggc ggccccgaggcggcgaccggcgcgctgcggctccggcacccgagccccgccgccacctcc ccgcccggcgcgcccctgccgcactacctgggcgcctcggtcatcatcaccaacgggagg tga >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_5|182_aa MVVVVVDTDIMNFIYKDINLRVKWPNDIYYSDLMKIGGVLVNSTLMGETFYILIGCGFNV TNSNPTICINDLITEYNKQHKAELKPLRADYLIARVVTVLEKLIKEFQDKGPNSVLPLYY RYWVHSGQQVHLGSAEGPKVSIVGLDDSGFLQVHQEGGEVVTVHPDGNSFDMLRNLILPK RR >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_5|549_bp atggtggtggtggtggtggatactgacataatgaacttcatttacaaggatatcaactta cgagtgaagtggcccaacgatatttattacagtgacctcatgaagatcggcggagttctg gttaactcaacactcatgggagaaacattttatatacttattggctgtggatttaatgtg actaacagtaaccctaccatctgcatcaacgacctcatcacagaatacaataaacaacac aaggcagaactgaagcccttaagagccgattatctcatcgccagagtcgtgactgtgctg gagaaactgatcaaagagtttcaggacaaagggcccaacagcgtccttcccctttattac cgatactgggtccacagtggtcagcaagtccatctgggcagcgcagagggaccaaaggtg tccatcgttggcctggacgattctggcttcctccaggttcaccaggagggcggcgaggtt gtgactgtgcacccggacggcaactccttcgacatgctgagaaacctcatcctccccaaa cggcggtaa >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_6|110_aa MTRPLNIPQNVFHGEGVEENREHYRTAKENTLRLSSTISIKISETGQDSELDPVSVCGPE SCRYRLQAQHANTKHGDQAMSSHWNRKKEKKGNALRLLRGHIDGAQLASI >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_6|333_bp atgaccaggcctctgaacattccacaaaacgtgttccatggagagggtgtagaggagaat agggagcattaccgaacagcaaaggaaaacactttaagactctcctcaaccatttccatt aaaatttcagaaacaggccaggattctgagttggatcctgtgtctgtctgtggccctgag agctgcaggtatcgactacaggctcagcatgcaaacacgaagcatggtgaccaagccatg tcatcacactggaaccgcaaaaaagagaagaaaggaaatgcgcttcgtcttctgcgagga cacattgacggtgctcagctagccagcatctag >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_7|150_aa MLRAALTCQPPAASVPTGLWAPTSTGFWDAKGALRAVWHRPVRALQHEQPGCCGHRGCQV DGCRRQTGSWAESGTSLVKLHLQVKDNLKHFLGLDSFCLFDDPKSTEKPEVSSVGGDPVE GKYKPESRMLSADWEATSWSGLVPALAPGT >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_7|453_bp atgctcagggcggcgctgacatgccagccccctgctgcctcagtccccactggactttgg gcaccaacaagcacaggcttctgggatgccaagggggcgctgagggcagtttggcacagg cctgtacgcgcccttcagcatgaacagcctgggtgctgtgggcaccgtggatgtcaggtt gatggctgcagaaggcagacaggctcctgggcagaaagtgggacgtccctggtgaagctc catcttcaagtcaaggacaacctgaagcatttcttggggctggatagcttttgtcttttc gatgaccccaaaagcacggaaaaaccagaagtcagctccgtgggtggtgaccccgtggag gggaaatataaaccagaaagtaggatgctgagtgcagactgggaggcgacttcatggtct ggactagtgccagccctagcgcctgggacgtga >gi568815577f:36599747_36845270|GENSCAN_predicted_peptide_8|61_aa XGEVTVEGHRPLPVWSCVSGAVFQNSFVTVTRRFSRGTFVGLCYEYKQTVIRMKVRTWHW G >gi568815577f:36599747_36845270|GENSCAN_predicted_CDS_8|186_bp ntgggagaggtgactgtggaaggacatcggcccttaccagtatggagctgtgtaagtggg gccgtgttccagaatagttttgtcacagtgactaggagattttctcgggggacttttgtg ggactctgctacgaatacaagcaaaccgtaataagaatgaaggttcggacgtggcactgg ggatag