GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:40:41 Sequence gi568815581r:64649919_64858379 : 208461 bp : 47.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4011 4178 168 2 0 82 94 113 0.793 10.83 1.02 Term + 27548 27643 96 1 0 100 39 76 0.737 1.87 1.03 PlyA + 27800 27805 6 1.05 2.00 Prom + 45335 45374 40 -3.36 2.01 Init + 56592 56638 47 2 2 85 99 80 0.950 7.10 2.02 Intr + 63148 63217 70 1 1 112 89 -11 0.145 0.58 2.03 Intr + 100499 100999 501 1 0 14 -95 599 0.336 27.38 2.04 Term + 101045 101392 348 1 0 19 47 473 0.951 30.79 2.05 PlyA + 101658 101663 6 1.05 3.05 PlyA - 101711 101706 6 1.05 3.04 Term - 102038 101871 168 2 0 91 34 74 0.554 0.18 3.03 Intr - 104736 104503 234 0 0 97 92 171 0.941 16.29 3.02 Intr - 108461 108324 138 2 0 21 98 142 0.916 9.16 3.01 Init - 108685 108611 75 1 0 73 79 57 0.891 4.39 3.00 Prom - 114226 114187 40 -1.76 4.12 PlyA - 114899 114894 6 1.05 4.11 Term - 129021 128530 492 0 0 5 48 573 0.518 39.61 4.10 Intr - 129161 129066 96 2 0 102 -44 220 0.531 10.41 4.09 Intr - 131121 130980 142 2 1 129 64 -26 0.231 -0.54 4.08 Intr - 131698 131477 222 0 0 68 59 79 0.028 0.24 4.07 Intr - 136806 136649 158 0 2 64 85 257 0.958 21.81 4.06 Intr - 141123 141039 85 2 1 136 49 15 0.909 2.22 4.05 Intr - 142624 142431 194 1 2 60 83 338 0.945 28.79 4.04 Intr - 147491 147346 146 0 2 97 131 198 0.977 25.00 4.03 Intr - 150999 150082 918 1 0 113 55 1365 0.998 126.93 4.02 Intr - 151215 151184 32 2 2 70 93 18 0.949 -1.73 4.01 Init - 152301 152138 164 0 2 69 80 181 0.950 14.60 4.00 Prom - 154579 154540 40 -6.96 5.11 PlyA - 156218 156213 6 1.05 5.10 Term - 161943 161848 96 0 0 108 32 52 0.502 -0.43 5.09 Intr - 165166 164945 222 1 0 59 117 146 0.923 12.92 5.08 Intr - 165387 165300 88 2 1 74 97 -19 0.065 -2.43 5.07 Intr - 172475 171849 627 1 0 82 70 694 0.078 58.32 5.06 Intr - 174954 174707 248 0 2 77 52 324 0.955 23.96 5.05 Intr - 178453 178357 97 0 1 106 63 9 0.557 0.21 5.04 Intr - 183531 183410 122 0 2 112 59 64 0.390 5.19 5.03 Intr - 187415 187138 278 0 2 95 97 151 0.875 13.94 5.02 Intr - 197067 196981 87 1 0 45 92 82 0.229 4.24 5.01 Intr - 205971 205922 50 2 2 90 94 91 0.943 8.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 172475 171845 631 1 1 82 48 698 0.876 58.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:64649919_64858379|GENSCAN_predicted_peptide_1|87_aa MNGSQKHYAKRSLTQKTADSMIPFTQNPRKGKIIVTENRSVVDKFSVLKKTTGYKENWTL TWNPAHGLDPTTQLPRFYNLERRNTYG >gi568815581r:64649919_64858379|GENSCAN_predicted_CDS_1|264_bp atgaatggatctcaaaagcattacgctaaaagaagcctcacccaaaagactgcagacagt atgattccatttactcaaaatcctagaaaaggcaaaattatagtgaccgaaaacagatca gtggttgacaagttctcggtattgaagaagacaactggctacaaagagaactggactctc acctggaacccagcccacggcttggatcccaccacccagctgccacgcttctacaatcta gagcggcgtaatacatatggctag >gi568815581r:64649919_64858379|GENSCAN_predicted_peptide_2|321_aa MLFIKLAFPLFIKLPRFSKQILNMSCKSQESAHRETQRRTKVRGLIEIISNAAGYENIPI QHHEENFLRQLAQKVPDKLNNPKFNDPHIKTNLLLQAHLSRMQLSAELQSDTEEILSKAV RLVQACVDVLSSNGWLSSALAAMELAQMVTQAMWSKDSYLKQLPHFISEHIKRCTDKGVK SVFDIMEMEDEERNTILQLTDSQITDEVVDKDSIRSGGPAVVLVQLEREEEVTGPVIAPL FPQKREEGWWVVIGDTKSNSLISIKRLILQQKAKVKLDFVAPATGAHNYTLYFMSDAYMG CDQEYKFSMDVKEAETDSDSD >gi568815581r:64649919_64858379|GENSCAN_predicted_CDS_2|966_bp atgctgttcatcaaactggcctttcccctgttcatcaaactgcccagattctccaagcaa atattaaacatgtcctgcaaatctcaggaatcagcacacagagaaacacaaagaagaacc aaggtgcgagggcttattgagatcatctccaatgcagcagggtacgagaatatccccatc cagcaccatgaagaaaacttcctgaggcagttggctcagaaggtccctgacaagctgaat aaccctaagttcaatgatccgcacatcaagaccaacctgctcctgcaggctcacctgtcc cgcatgcagctgagtgctgagttgcagtcagatacggaggaaatccttagtaaggcagtc cggctcgtccaggcctgcgtggatgtcctctccagcaatgggtggctcagctctgctctg gcagctatggaattggcccagatggtcacccaagccatgtggtccaaggactcatacctg aagcagctgccacacttcatctctgagcatatcaaacgttgcacagacaagggagtgaag agtgttttcgacatcatggagatggaggatgaagaacggaacacaattcttcagctgact gacagccagattacagatgaggtggtggataaggacagcatccgcagtggcgggccagcg gtggtgctggtgcagctggagcgagaggaggaagtcacaggccctgtcattgcgcctctc ttcccgcagaaacgtgaagagggctggtgggtggtgattggagacaccaagtccaatagc ctcatctccatcaagaggctgatcctgcagcagaaggccaaggtgaagttggactttgtg gctccagccactggtgcccacaactacactctatacttcatgagtgatgcttacatggga tgtgaccaggagtacaaattcagcatggatgtgaaagaagccgagacggacagtgattca gattga >gi568815581r:64649919_64858379|GENSCAN_predicted_peptide_3|204_aa MSTNENANTQAAHLHRSKNKGKDSTETRRHRIEVNVELRKAKKDDQMLKRRNVSSFPDDA TSPLQENCNNQTPALRAIGNIVTGTDEQTQVVIDAGALAVFPSLLTNPKTNIQKEATWTM SNITASRQDQIQQVVNHGLVPFLVSDLSKAAEKLGETEKLSIMIEECGGVDKIEALQNHE NESVYKASLSLTEKYFSVEVSNGW >gi568815581r:64649919_64858379|GENSCAN_predicted_CDS_3|615_bp atgtccaccaatgagaatgctaatacacaagctgcccatcttcacagatctaagaacaag ggaaaggacagtacagaaacgaggcgtcacagaatagaggtcaatgtggagctgaggaaa gctaagaaggatgaccagatgctgaagaggagaaatgtaagctcatttcctgatgatgct acttctccgctgcaggaaaactgcaacaaccagactcctgccctaagagccatagggaat atcgtcactggtacagatgaacagactcaggttgtgattgatgcaggagcactcgccgtc tttcccagcctgctcaccaaccccaaaactaacattcagaaggaagctacgtggacaatg tcaaacatcacagccagccgccaggaccagatacagcaagttgtgaatcatggattagtc ccattccttgtcagcgatctctctaaggctgctgagaaactaggtgaaactgagaaactt agtataatgattgaagaatgtggaggcgtagacaaaattgaagctctacaaaaccatgaa aatgagtctgtatataaggcttcattaagcttaactgagaagtatttctctgtagaggtg agtaatggatggtaa >gi568815581r:64649919_64858379|GENSCAN_predicted_peptide_4|882_aa MRTIQEEAGWQQEFILDMSAEILVDMYGTGGYMNVDFKNKVQAGDKNWELKPESSEIQDA RAAVDGLSNPFQGLMKLGTVERQGAMGIWKELFCELSPLEFRLYLSNEEHTCVENCSLLR CESVGPAHSDGRFELVFSGKKLALRASSQDEAEDWLDLVREALQKVRPQQEDEWVNVQYS DQPEEPPEAPQGCLSPSDLLSEPAALQGTQSDWSSAQVPELDAIKESLLYLYMDRTWMPY IFSLPLEALKCFRIRNNEKMLSDSHGVETIRDILPDTSLGGPSFFKIITAKAVLKLQAGN AEEAALWRDLVRKVLASYLETAEEAVTLGGSLDENCQEVLKFATRENGFLLQYLVAIPME KGLDSQGCFCAGCSRQIGFSFVRPKLCAFSGLYYCDICHQDDASVIPARIIHNWDLTKRP ICRQALKFLTQIRAQPLINLQMVNASLYEHVERMHLIGRSREQLKLLGDYLGLCRSGALK ELSKRAGPESASLKTASSILAAVLPEPGLLRARIADEVYEGFLKALIEFASQHVYHCDLC TQRSFICQICQHHDIIFPSEFDTTVSPAPVREEVLPGALRRCLTGYSAIQETWQRRSRAV EEGARDAEPTRACRDKWRRGRRADAPRCPGRQWAAERELRLAKELWPGPLLIQPETLRSG GRPCAGNPSRSFSRPGRVAPGKALLGEENAAAAMAADVEGDVYVLVEHPFEYTGKDGRRH RALSIGGTCGASPAAAPSTCPRSTCASCLRWATLPLTRRQVPPSGPAAPEPLAYDYRFVS AAVAAAPNGPPAEPRGGASSLCGPAQRGAASQRSSLAPGLPACLYLRPTAPVRPAQSLDD LTRAVVSPPAGNLGSSGSFKACSVAGSWVCLLPLSRSDSENV >gi568815581r:64649919_64858379|GENSCAN_predicted_CDS_4|2649_bp atgaggaccattcaggaggaagcaggctggcagcaggaattcatcctggacatgtcagct gagatcctagtggacatgtatggcacaggtggatatatgaatgtggatttcaagaacaag gtccaggctggcgataaaaattgggagctaaagcctgagagcagtgaaatacaggatgca agagcagcagtggacggactgtccaacccattccagggtctcatgaagctgggcaccgtg gagcggcagggggcaatgggcatctggaaggagctcttctgcgagctctccccactggag ttccgcctctacctgagcaacgaggagcacacctgtgtggagaactgctccctgcttcgc tgtgagtctgtggggccagcccacagtgacgggcgctttgagctggtcttctctggcaag aagctggccctgcgcgcctcctcccaggacgaagctgaggactggctggacctggtgcgg gaggccctgcagaaggtccggcctcagcaggaggatgagtgggtgaacgtgcagtactca gaccagcctgaggaaccccccgaggcgccccagggctgcctctctccctcagacctgctc tcggagcccgcggccctccagggcacacagtctgactggtcgtccgcccaggttccagag ctagatgccatcaaggagtccctgctgtacttgtacatggacaggacctggatgccctat atattttctctgcccttggaggctctgaaatgtttccgcatcaggaacaatgagaagatg ctgagtgacagccacggcgtggagaccatccgggacatcctgccagacaccagccttggg ggcccatccttcttcaaaatcatcacggccaaggctgtcctgaagctgcaggccggaaac gccgaggaagccgccctgtggagggatctggtccgcaaagtcctggcatcctacttggag acagccgaggaggcggtgaccctgggcgggagcctggatgaaaactgtcaggaggtgctg aaatttgccacccgggagaatggcttcctgctgcagtacctggtggccatccccatggag aaaggccttgactcccaaggctgcttctgcgcaggctgctcccggcagatcggcttctcc tttgtacgacccaagctctgtgccttctctggcctctattactgtgacatctgccaccaa gacgatgcctcagtgattccggccaggatcatccacaactgggacctcaccaagcgcccg atctgcaggcaggccctgaagtttctgacgcagatccgggcccagcccctcatcaacctg cagatggtgaacgcgtctctgtacgagcatgtggagcggatgcacctcattgggaggagc cgggagcagctgaagctcctgggggattacctgggcctgtgccggagtggcgccctgaag gagctcagcaagagggctggcccagagagcgcctctctgaagactgcctcctccatcctg gctgcagtgctcccagagccgggcctcctcagggccaggatcgcagatgaggtatatgaa ggattcctcaaggccctgattgaatttgcctcccagcatgtctaccactgcgacctgtgc acccagcgcagcttcatctgccagatctgccagcaccacgacatcatcttcccctctgag tttgacaccacagtcagcccagccccagtccgggaggaagttctgccgggcgctctccgc cggtgcctcactggttatagtgctatacaggaaacctggcagcgccggagccgcgcggtc gaggagggagcgcgggacgccgagcccacgcgcgcctgccgggacaagtggagacgaggc cggcgagcggacgccccgaggtgcccgggcaggcagtgggccgccgagcgggagttgagg ttggcgaaggagctgtggccgggtccccttcttatccagcccgagacactgcgctccggt gggcgtccctgcgctgggaatccctctcggagtttttccaggcccggccgggttgctccg ggaaaggccctgttgggggaggaaaatgccgcggccgcaatggcggcggatgtggagggg gacgtgtacgtactggtggagcaccccttcgagtacaccggcaaggacgggcgccgccac agagcactgagcattggtggcacgtgcggcgcaagcccggcggccgccccttctacctgc ccgcgcagtacgtgcgcgagctgcctgcgctgggcaaccctgccgctgacgcgccgccag gtccccccatcgggacccgcagcccctgagccgctcgcctacgactaccggttcgtgagc gcggctgtggcagcggcccccaacggccccccagcggagccccgaggcggggccagctcc ctgtgcggccctgcgcagcgcggcgccgcgagccagcgcagcagcctggcgcccggcctg cccgcctgcctgtacctgcggcccacggcgcccgtgcggcccgcgcagtccctggacgac ctgacgcgcgccgtcgtctcgcctcccgccggcaacctcggaagcagcggcagcttcaag gcctgcagcgtagcgggctcctgggtgtgcctgctgcccctgtcgcgcagcgactcagag aacgtctaa >gi568815581r:64649919_64858379|GENSCAN_predicted_peptide_5|638_aa XSVVTEGHYKKMKKDSQELSDTKSRIYCHPKGPPEERGYTGSNTKAGAPSSPASQTSHLP RGSAPRDFREQLAPPQGSFGLRDWHRGPGAGPERRANVMASGAGALLPVVSGRETASSAR RLRGCRSREPPSPRSSLRLVGLLEATPLFTLVLCVNEERFEDRKWNQFHRMVDSESGPPG ACRIPPQDPPELGQKPGQPSLLTTAAQTDSMRVIKKKLVGSVKALQKQYVSLDMVVTSED GDANTMCSALEAVFIHGLHTKHIRAEAGGKRKKSAHQKPLPQPVFWPLLKAVTPKHIISE LEHLTFVNMDVGRCRAWLRLALNNGLMECYLKLLLQEQARLREYYQPTALLRDAEEGEFL LSFLQGLMSLSFELSYKSAILNEWTLTPLALSGLCPLSELDPLSTSGAELQRKESLDSIS HSSGSEDIEVHHSGHKIRRNQKPTASSLSLDTASSSQLSCSLNSDSCLLQENGSKSPDHC EEPMSYDSDLGTANAEDSDQSLQDCLGHYRRQPVTQVSLFKTSMVLLEFSKAQAASGTQD GVHVQEPHPQAPSPLDLQQPVESTSGQQPSSTVSETAREVGQGNGLQKAQAHDGAGLKLV VSSPTSPNISSMIQSTRPVHFCIVSTKHCTWEKVGMDE >gi568815581r:64649919_64858379|GENSCAN_predicted_CDS_5|1917_bp nnatctgttgtcaccgaaggtcattacaagaagatgaagaaggattctcaagagctgtca gacaccaaatcccgtatctactgccacccaaagggacctccagaagaaaggggttataca gggtcaaacaccaaggcaggcgcccccagttccccagcctcccagacctcgcatctgccc cggggctcagctcccagggacttccgggagcagctggccccgccccaaggctccttcggg ctgcgcgattggcaccgcgggccgggggcagggccggagcgccgagccaacgtgatggcg tcaggggccggggcgctgcttcctgttgtcagtggccgagagaccgcatcgtcggctcgg aggctgaggggctgccgcagccgggagcccccctcgcctcgctcctcgctccgcttggta ggtctccttgaagccacacctctcttcactttggttctttgtgtcaatgaagagcgtttt gaggacagaaagtggaaccagttccataggatggtagattcagaatctgggccaccaggt gcctgccgaatcccaccccaggaccctccagagctagggcagaagcctggccaacccagc cttctaacaacagctgcacaaactgactccatgagggtcatcaagaagaagctggtggga tccgtgaaagccttgcaaaagcagtacgtgtccctggacatggtggtcactagtgaagac ggagatgccaacaccatgtgcagcgccctggaggccgtatttatccatggcctgcacacc aagcacatccgagctgaggccggaggaaaaaggaagaaaagtgcccaccagaagcctctg ccccagcctgtcttctggcccctcctgaaagctgtcacccccaaacacatcatctcagag ttggagcacctgacgtttgtcaacatggatgtgggccgctgccgggcatggctgcggctg gccctgaacaatggcctgatggagtgctacctgaagctgctgctgcaggagcaggcccgc ttgcgtgagtactaccagcccaccgccctgctccgggatgctgaggagggcgagttcctc cttagcttcctgcagggcctcatgtccttgtccttcgaactctcctacaagtctgccatc ttaaatgagtggacgctcaccccactggccctgtctgggctttgcccgctttctgagctg gaccctctctctacctctggtgcagaactacagcggaaggaatctctggattccatttcc cattcttcaggctctgaagacatcgaagtccatcactcgggccataagatacggaggaac cagaagccgactgcctcctccctcagcctggacacggccagttcatcccagctgtcctgc agcctaaactctgatagctgcttactccaagagaatggctccaagagtccagaccattgc gaggagcccatgtcctatgactcagacctgggcacagcaaatgctgaggactcagaccag tctctgcaagattgccttgggcattatcgcagacaacctgtcacacaggtttctttgttt aaaacttctatggtattgttggaattcagcaaagcccaggcagcctctggaactcaagat ggtgtccacgtgcaggagccgcatccccaggcgcccagccccctggacttacagcagcct gtagagagcacctcaggccagcagccttctagtactgtcagcgagacagccagagaagtg ggccaagggaatggcctgcagaaggcccaggctcatgacggagctggtctgaagctggta gtttcctcacccaccagtccgaatattagctctatgatacagagcaccaggcctgttcac ttctgtattgtcagcaccaagcactgtacctgggagaaagtaggcatggatgaatag