GENSCAN 1.0 Date run: 5-Nov-116 Time: 15:27:33 Sequence gi568815587r:8821319_9032308 : 210990 bp : 43.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6961 7140 180 0 0 101 20 109 0.498 5.46 1.02 Intr + 7417 7580 164 0 2 45 69 130 0.481 5.57 1.03 Intr + 16414 16554 141 1 0 40 72 112 0.103 4.27 1.04 Intr + 26341 26410 70 1 1 56 83 51 0.007 0.58 1.05 Intr + 46819 46884 66 0 0 54 98 33 0.000 0.00 1.06 Term + 71379 72311 933 2 0 -8 48 280 0.002 6.53 1.07 PlyA + 72343 72348 6 1.05 2.00 Prom + 73609 73648 40 -2.46 2.01 Init + 90132 90353 222 2 0 103 78 136 0.630 12.67 2.02 Intr + 93508 93612 105 0 0 53 111 41 0.890 3.31 2.03 Intr + 98019 98156 138 2 0 128 -14 153 0.015 9.76 2.04 Intr + 104327 104461 135 1 0 94 33 75 0.002 3.36 2.05 Intr + 110728 110844 117 0 0 105 105 33 0.440 7.36 2.06 Term + 111460 111534 75 0 0 60 55 71 0.254 -1.16 2.07 PlyA + 113294 113299 6 1.05 3.03 PlyA - 115297 115292 6 -0.45 3.02 Term - 116855 116298 558 2 0 73 44 464 0.339 34.85 3.01 Init - 124071 123841 231 0 0 63 47 86 0.170 0.26 3.00 Prom - 124561 124522 40 -5.56 4.06 PlyA - 124720 124715 6 1.05 4.05 Term - 127157 127002 156 2 0 58 32 194 0.992 8.73 4.04 Intr - 132019 131885 135 1 0 54 98 19 0.525 0.26 4.03 Intr - 134980 134872 109 0 1 79 93 93 0.884 9.19 4.02 Intr - 140865 140774 92 0 2 70 111 4 0.831 -0.31 4.01 Init - 142995 142891 105 0 0 98 113 274 0.999 29.12 4.00 Prom - 149410 149371 40 -7.36 5.16 PlyA - 149454 149449 6 1.05 5.15 Term - 151268 151205 64 1 1 102 42 72 0.220 1.36 5.14 Intr - 162651 162557 95 0 2 43 89 103 0.570 4.66 5.13 Intr - 162806 162754 53 0 2 88 116 26 0.996 4.03 5.12 Intr - 164532 164393 140 2 2 45 99 101 0.997 7.01 5.11 Intr - 166312 166230 83 1 2 88 98 18 0.975 1.24 5.10 Intr - 166943 166800 144 2 0 55 95 101 0.447 7.98 5.09 Intr - 182604 182444 161 1 2 34 89 366 0.361 31.01 5.08 Intr - 183253 183031 223 1 1 68 99 149 0.105 11.70 5.07 Intr - 198753 198627 127 2 1 97 28 27 0.002 -1.72 5.06 Intr - 199879 199749 131 1 2 87 7 106 0.177 1.89 5.05 Intr - 200637 200558 80 1 2 46 113 80 0.914 5.57 5.04 Intr - 201438 201394 45 1 0 80 107 9 0.518 0.38 5.03 Intr - 204536 204384 153 0 0 92 103 192 0.998 21.14 5.02 Intr - 206243 206046 198 0 0 80 38 329 0.979 26.42 5.01 Intr - 209607 209440 168 1 0 92 105 73 0.980 9.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 49747 49636 112 1 1 48 99 145 0.843 9.97 S.002 Term + 98019 98162 144 2 0 128 41 150 0.983 12.21 S.003 Init + 196554 196614 61 2 1 91 80 100 0.890 8.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:8821319_9032308|GENSCAN_predicted_peptide_1|517_aa KCLDTEQCTFNSTTRPLRLHQNSFRSPALGKAKPPAPQDAALLWFVFRGSNAAVSAGKLG GTLTKQKIPRCFAAVMSEFTQEGKLLQQRKLIRPDKSGEQRWKTGMRQETVEDKKVQEFE HSHEQNQQSINVELGHVSKHRRSLEATTSKDQSPHDGEIKARMADIDSQGNETDYASAGC KRNRNSSPVVTPTSSLPSEPSYSLQGTEEVESLNRPITGSEIEAIINSLPTKKSPGPNGF TVEFYQRYKEELVPFLLKLFQSIEKEGILPNPFYEASIILIPKPGRDTTKKENFRPISLV NINAKILSKILANRIQQHIEKLIHHDQVGFIPGMQGWFNIRKSINIIQHINRTKDKNHMT ISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRATYDKPTANIILNGQKLEAFPLKTG TRQGCPLSPLLFNIVLKVLARAIRQEKEINGIQLGKEEVKLSLFADDMIVCLENPIVSAQ NLLKLISNFSKVSGYKINVQKSQHSYTPITDKQRAKS >gi568815587r:8821319_9032308|GENSCAN_predicted_CDS_1|1554_bp aagtgcctggacacagagcagtgcaccttcaacagcacaactcggccgttaaggctgcac caaaattcattcaggagcccagccctgggcaaagcaaagcctcctgccccccaggatgca gcacttctctggtttgtcttcaggggctccaatgcagctgtatctgcggggaagctgggg ggaacactgaccaaacagaagattccacgctgctttgctgcggtcatgtctgagtttacc caagaggggaagctgctccaacagaggaagttaattagaccagacaagagcggggagcag aggtggaagacagggatgagacaagaaactgtggaggacaaaaaggtacaagaatttgaa catagccatgaacagaaccaacaatcaataaatgtagagcttggacatgttagcaaacat cgaaggtctctggaagctacaacttccaaggaccaatctccgcatgatggagaaatcaaa gccagaatggctgatattgattctcaaggcaatgaaacagactacgcaagtgctggctgc aaaagaaacagaaacagcagccccgtggtcactccaactagctctttacccagtgaacca agctactcgctgcagggtacagaagaagttgaatccctgaatagaccaataacaggctct gaaattgaggcaataattaatagcttaccaaccaaaaaaagtccaggaccaaatggattc acagtcgaattctaccagaggtacaaggaggagctggtaccattccttctgaaactattc caatcaatagaaaaagagggaatcctccctaacccattttatgaggccagcatcattctg ataccaaagcctggcagagacacaacaaaaaaagagaattttagaccaatatccctagtg aacatcaatgcaaaaatcctcagtaaaatactggcaaaccgaatccagcagcacatcgaa aagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacata cgcaaatcaataaacataatccagcatataaacagaaccaaagacaaaaaccacatgact atctcaatagatgcagaaaaggcctttgacaaaattcaacagcctttcatgctaaaaact ctcaataaattaggtattgatgggacgtatctcaaaataataagagctacttatgacaaa cccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaaccggc acaagacagggatgccctctctcaccactcctattcaacatagtgttgaaagttctggcc agggcaatcaggcaggagaaagaaataaatggtattcaattaggaaaagaagaagtcaaa ttgtccctgtttgcagatgacatgattgtatgtttagaaaaccccatcgtctcagcccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacagcattcttatacaccaataacagacaaacagagagccaaatcatga >gi568815587r:8821319_9032308|GENSCAN_predicted_peptide_2|263_aa MDNCLAAAALNGVDRRSLQRSARLALEVLERAKRRAVDWHALERPKGCMGVLAREAPHLE KQPAAGPQRVLPGEKYYSSVPEEGGATHVYRYHRGESKLHMCLDIGNGQAENISKDLYIE VYPGTYSVTVGSNDLTKKTHVVAVDSGQSVDLVFPEEDEEEETARGACIATFSSLGPSRG KSCAVGSNSGSSLAMTSEVLEASLSTKVYHQQPRTKEKEKGIHFPQRGRDRYLARGLCPP CSEETAATRAQDQAALLQPTRAP >gi568815587r:8821319_9032308|GENSCAN_predicted_CDS_2|792_bp atggacaactgtttggcggccgcagcgctgaatggggtggaccgacgttccctgcagcgt tcagcaaggctggctctagaagtgctggagagggccaagaggagggcggtggactggcat gccctggagcgtcccaaaggctgcatgggggtccttgcccgggaggcgccccacctagag aaacagccggcagccggcccgcagcgcgttctcccgggagagaaatattattcatctgtg ccagaggaaggaggggcaacccatgtctatcgttatcacagaggcgagtcgaagctgcac atgtgcttggacatagggaatggtcaggctgagaacatctctaaggacctctacatagaa gtatatccagggacctattctgtcactgtgggctcaaatgacttaaccaagaagactcat gtggtagcagttgattctggacaaagcgtggacctggtcttccctgaggaggatgaggag gaagaaacagccaggggagcgtgcattgctactttctcctctttaggaccttccaggggc aaaagctgtgctgtgggctccaactctggaagttctctggccatgacttctgaggtcctg gaagcaagtctgtccaccaaggtctaccaccaacagccaagaacaaaggaaaaagaaaag ggcatccacttcccccagaggggcagagacaggtaccttgcaaggggcttgtgcccgccc tgttcagaggagacagctgccaccagggcccaagatcaagccgcactgctgcagcccacc agagccccgtga >gi568815587r:8821319_9032308|GENSCAN_predicted_peptide_3|262_aa MGKSPGHLTKEEIQMANKHMKRCSTSYVIREMQTETTMGYQYTPIRMTKIYDTDNTKCSQ GCGATGTFFYCWWEYKMVKEEMMDNRGNSSLPDKLPIFPDSARLPLTRSFYLEPMVTFHV HPEAPVSSPYSEELPRLPFPSDSLILGNYSEPCPFSFPMPYPNYRGCEYSYGPAFTRKRN ERERQRVKCVNEGYAQLRHHLPEEYLEKRLSKVETLRAAIKYINYLQSLLYPDKAETKNN PGKVSSMIATTSHHADPMFRIV >gi568815587r:8821319_9032308|GENSCAN_predicted_CDS_3|789_bp atgggcaaaagtcctggacacctcaccaaagaggagatacagatggcaaataagcatatg aaaagatgttccacatcatatgtcatcagggaaatgcaaactgaaacaacaatgggatac caatacacacctattagaatgaccaaaatctacgacactgataacaccaaatgctctcaa ggatgtggagcaacaggaactttcttttattgctggtgggaatacaaaatggttaaagag gaaatgatggacaacagaggcaactctagtctacctgacaaacttcctatcttccctgat tctgcccgcttgccactgaccaggtccttctatctggagcccatggtcactttccacgtg cacccagaggccccggtgtcatccccttactctgaggagctgccacggctgccttttccc agcgactctcttatcctgggaaattacagtgaaccctgccccttctctttcccgatgcct tatccaaattacagagggtgcgagtactcctacgggccagccttcacccggaaaaggaat gagcgggaaaggcagcgggtgaaatgtgtcaatgaaggctacgcccagctccgccatcat ctgccagaggagtatttggagaagcgactcagcaaagtggaaaccctcagagctgcgatc aagtacattaactacctgcagtctcttctgtaccctgataaagctgagaccaagaataac cctggaaaagtttcctccatgatagcaaccaccagccaccatgctgaccctatgttcaga attgtttga >gi568815587r:8821319_9032308|GENSCAN_predicted_peptide_4|198_aa MATLWGGLLRLGSLLSLSCLALSVLLLAQLSDAAKNFEDVRCKCICPPYKENSGHIYNKN ISQKDCDCLHVVEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTIIIYLSILGLLLLYMV YLTLVEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSRSRANVLNKVEYAQQRWK LQVQEQRKSVFDRHVVLS >gi568815587r:8821319_9032308|GENSCAN_predicted_CDS_4|597_bp atggcgaccctgtggggaggccttcttcggcttggctccttgctcagcctgtcgtgcctg gcgctttccgtgctgctgctggcgcagctgtcagacgccgccaagaatttcgaggatgtc agatgtaaatgtatctgccctccctataaagaaaattctgggcatatttataataagaac atatctcagaaagattgtgattgccttcatgttgtggagcccatgcctgtgcgggggcct gatgtagaagcatactgtctacgctgtgaatgcaaatatgaagaaagaagctctgtcaca atcaaggttaccattataatttatctctccattttgggccttctacttctgtacatggta tatcttactctggttgagcccatactgaagaggcgcctctttggacatgcacagttgata cagagtgatgatgatattggggatcaccagccttttgcaaatgcacacgatgtgctagcc cgctcccgcagtcgagccaacgtgctgaacaaggtagaatatgcacagcagcgctggaag cttcaagtccaagagcagcgaaagtctgtctttgaccggcatgttgtcctcagctaa >gi568815587r:8821319_9032308|GENSCAN_predicted_peptide_5|621_aa XLCQPGEYSADGFAPCQLCALGTFQPEAGRTSCFPCGGGLATKHQGATSFQDCETRDRRC GGELGDFTGYIESPNYPGNYPANTECTWTINPPPKRRILIVVPEIFLPIEDDCGDYLVMR KTSSSNSVTTYETCQTYERPIAFTSRSKKLWIQFKSNEGNSARGFQVPYVTYDGQREEEK QRSLREVTEDYQELIEDIVRDGRLYASENHQEILKDKKLIKALFDVLAHPQNYFKYTAQE SREMFPRSFIRLLRSKVSSLKCERRTLLRGHEQPLEGWSHSLTVTALPETPPECHTIISL WSRGPVGGDRPEVGEIRSAPNLGGSRQSSGPGRWTLEPRLAAWRCVSEKPSSGAGGGTRG MARLSVIPGSATAWTGLLTEGGRKETDMREAASLRQQRRMKQAVQFIHKDSADLLPLDGL KKLGSSKDMRRLMETNLSKLRSGPRVPWASKTNKLNQAKSEGLKKSEEDDMILVSCQCAG KDVKALVDTGCLYNLISLACVDRLGLKEHVKSHKHEGEKLSLPRHLKVVGQIEHLVITLG SLRLDCPAAVVDDNEKNLSLGLQTLRSLKCIINLDKHRLIMGKTDKEEIPFVETVSLNED NPPITGPGDGVDDVLAPHYYF >gi568815587r:8821319_9032308|GENSCAN_predicted_CDS_5|1866_bp ngtctgtgtcaacctggtgaatattctgcagatggctttgcaccttgccagctctgtgcc ctgggcacgttccagcctgaagctggtcgaacttcctgcttcccctgtggaggaggcctt gccaccaaacatcagggagctacttcctttcaggactgtgaaaccagagacagaagatgt ggaggggagctgggagatttcactgggtacattgaatccccaaactacccaggcaattac ccagccaacaccgagtgtacgtggaccatcaacccaccccccaagcgccgcatcctgatc gtggtccctgagatcttcctgcccatagaggacgactgtggggactatctggtgatgcgg aaaacctcttcatccaattctgtgacaacatatgaaacctgccagacctacgaacgcccc atcgccttcacctccaggtcaaagaagctgtggattcagttcaagtccaatgaagggaac agcgctagagggttccaggtcccatacgtgacatatgatggtcagagagaagaagaaaag caaagaagcctcagagaagtaaccgaggactaccaggaactcattgaagacatagttcga gatggcaggctctatgcatctgagaaccatcaggaaatacttaaggataagaaacttatc aaggctctgtttgatgtcctggcccatccccagaactatttcaagtacacagcccaggag tcccgagagatgtttccaagatcgttcatccgattgctacgttccaaagtgtccagtctc aaatgtgagagacggactctgctccgtggacatgagcagccccttgaagggtggtctcat tctctaacagtaacagcactgccagaaacaccccctgaatgccacacgatcatctcactt tggtcacgtggcccggtgggtggggaccgacctgaagttggagaaatccggagcgctccc aacctcggagggagtcgccagtcctccgggcccgggcggtggaccctggagccccggctg gcggcgtggaggtgcgtttctgagaagccgagcagcggcgcgggcggcgggactcgaggc atggcccggctgtcggtgatccccgggtcggccacggcgtggacagggctcctcactgag ggcggccgcaaggagaccgacatgcgggaggcggcgtcactgcgacagcagcgccggatg aagcaggcggtgcagttcatccacaaggactccgccgacctgctgcccctggacggcctc aagaagctgggctcgtccaaggacatgaggcgcctcatggaaaccaacctgtctaagctc cgaagcggtccccgtgtcccttgggcctctaagacgaacaaactcaatcaggctaagtct gaggggctaaagaagtctgaggaggatgacatgattttggtttcttgccagtgtgctgga aaggatgtgaaagccttggttgacacaggctgcctatataatctcatctctttggcctgt gtggacagattgggactcaaggagcatgtcaaatcccacaagcatgaaggagaaaagctt tctctaccccggcatctcaaagtagtgggccagattgagcacctagtgatcacactgggc tccctccgcctggactgcccagcagctgtggttgatgacaatgagaaaaacttgtccctt ggtctacagactctccgatctctgaagtgcatcataaacttggataagcaccggctgatc atggggaagacagacaaggaagaaatcccttttgtggagacagtctctttgaatgaagac aatcctcctattactggtcctggagatggggtagatgatgttcttgctcctcattactac ttctag