GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:39:09 Sequence gi568815596r:210193723_210415042 : 221320 bp : 37.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 174 169 6 1.05 1.14 Term - 1616 1467 150 2 0 47 40 82 0.144 -3.77 1.13 Intr - 9722 9609 114 2 0 88 115 93 0.999 11.72 1.12 Intr - 10960 10859 102 1 0 59 71 111 0.984 5.95 1.11 Intr - 12074 11910 165 2 0 74 109 118 0.999 11.74 1.10 Intr - 16540 16474 67 0 1 90 93 55 0.995 4.19 1.09 Intr - 21431 21363 69 1 0 16 84 131 0.879 2.88 1.08 Intr - 22789 22625 165 0 0 72 115 131 0.999 12.35 1.07 Intr - 24380 24243 138 1 0 68 92 113 0.997 8.36 1.06 Intr - 27080 26925 156 1 0 31 89 185 0.999 11.10 1.05 Intr - 31327 31113 215 1 2 23 20 181 0.223 1.39 1.04 Intr - 31885 31465 421 0 1 51 101 295 0.292 20.12 1.03 Intr - 39843 39802 42 2 0 119 90 50 0.306 4.74 1.02 Intr - 56995 56929 67 2 1 89 115 37 0.064 3.54 1.01 Init - 78442 78433 10 1 1 91 93 4 0.095 1.95 1.00 Prom - 81004 80965 40 -4.05 2.02 PlyA - 81484 81479 6 1.05 2.01 Sngl - 88422 88045 378 0 0 84 31 218 0.980 11.61 2.00 Prom - 89021 88982 40 -8.35 3.07 PlyA - 89374 89369 6 1.05 3.06 Term - 90943 90822 122 2 2 50 49 79 0.082 -2.14 3.05 Intr - 100078 100001 78 2 0 29 70 138 0.791 4.70 3.04 Intr - 100696 100523 174 2 0 57 82 194 0.919 14.69 3.03 Intr - 104841 104698 144 1 0 72 86 75 0.950 5.03 3.02 Intr - 109070 109046 25 2 1 78 119 44 0.527 3.18 3.01 Init - 121320 121189 132 0 0 61 88 127 0.663 10.48 3.00 Prom - 124910 124871 40 0.25 4.00 Prom + 129728 129767 40 -5.25 4.01 Init + 130874 131112 239 1 2 53 95 113 0.250 6.03 4.02 Term + 143742 143868 127 0 1 60 48 120 0.054 1.97 4.03 PlyA + 146144 146149 6 1.05 5.00 Prom + 146168 146207 40 -6.55 5.01 Sngl + 157929 158258 330 2 0 88 37 314 0.998 22.07 5.02 PlyA + 158342 158347 6 1.05 6.00 Prom + 159338 159377 40 -6.15 6.01 Sngl + 159431 160654 1224 1 0 49 41 497 0.253 36.65 6.02 PlyA + 160741 160746 6 1.05 7.05 PlyA - 162593 162588 6 1.05 7.04 Term - 166709 166521 189 2 0 11 41 169 0.297 0.97 7.03 Intr - 171620 171559 62 0 2 90 100 48 0.062 3.73 7.02 Intr - 176793 176578 216 1 0 89 82 74 0.003 4.45 7.01 Init - 184450 184339 112 1 1 82 53 121 0.062 8.43 7.00 Prom - 185512 185473 40 -7.85 8.00 Prom + 186018 186057 40 -6.25 8.01 Init + 187676 187811 136 1 1 53 88 106 0.815 7.45 8.02 Intr + 188184 188575 392 1 2 75 50 180 0.156 6.42 8.03 Intr + 195566 195629 64 1 1 69 96 36 0.214 -0.13 8.04 Intr + 196517 196651 135 0 0 95 98 75 0.237 8.82 8.05 Term + 202776 202855 80 1 2 79 43 45 0.215 -3.95 8.06 PlyA + 203020 203025 6 1.05 9.05 PlyA - 203147 203142 6 1.05 9.04 Term - 203981 203905 77 0 2 56 35 107 0.133 -0.78 9.03 Intr - 206363 206230 134 1 2 86 45 64 0.070 1.27 9.02 Intr - 207863 207675 189 1 0 35 78 103 0.088 1.78 9.01 Init - 212359 212244 116 1 2 58 50 131 0.133 6.03 9.00 Prom - 219545 219506 40 -1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 176905 176495 411 1 0 66 49 220 0.914 11.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_1|626_aa MQNSGIKEVVNYGAGFSLQAHRLLRRHFFIRFSIYLVNPGYLRVFGGSIAFIRYSKRFRT LKRLRAASINKKECCKQRQQFRCKILLLTAKTGVFPRDQPVAWLGLRKGARERFFGRTPQ VDASADRPPSRGPCPESPSSRRVASVAPPPLCFGHGRTPSPRVPTRPGRPPCAAPAARRA SWPPFLDPADWVQPASGRRKKGCQPCRPENRPPSPRRGKPKDFSRSSFRSPNPTRLVQRP AVGPGSLRFGRRCSHSGGEERLETPSAKKLTDIGIRRIFSPEHDIFRKSVRKFFQEEVIP HHSEWEKAGEVSREVWEKAGKQGLLGVNIAEHLGGIGGDLYSAAIVWEEQAYSNCSGPGF SIHSGIVMSYITNHGSEEQIKHFIPQMTAGKCIGAIAMTEPGAGRQQQLVGQQKANSAKK ELKDAKEVDLQGIKTNAKKDGSDWILNGSKVFISNGSLSDVVIVVAVTNHEAPSPAHGIS LFLVENGMKGFIKGRKLHKMGLKAQDTAELFFEDIRLPASALLGEENKGFYYIMKELPQE RLLIADVAISASEFMFEETRNYVKQRKAFGKTVAHLQTVQHKLAELKTHICVTRAFVDNC LQLHEAKRLDSATACMAKYWYACYSN >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_1|1881_bp atgcagaactctggtataaaagaagttgtcaattatggtgctggtttttctctacaagct cacagacttctgagaagacacttttttattcgatttagcatctacctggttaatccaggg tatttgcgcgtatttgggggctccatagctttcatccgttactcaaagcgcttcaggacg ctaaaaaggctcagagctgcttcgataaacaagaaagaatgctgcaagcaaaggcagcag tttcgctgtaaaatcctattgctgacagccaagacgggcgtattccctcgcgaccagcct gtggcgtggttggggctccggaagggcgcgcgcgagcgcttttttgggaggacaccacag gtggacgcctcagctgatcgtcctccctcccggggaccctgccccgagtcgccgagtagc cgcagagtcgcctccgtcgccccgccgcccctgtgtttcggacatggccgcacgccttct ccgagggtccctacgcgtcctgggcggccaccgtgcgccgcgccagctgcccgccgcgcg agctggcctcctttcctggatcccgcggactgggtgcaaccagcatctgggcggaggaaa aaggggtgtcagccgtgccgccccgagaacagaccgccgagcccacgtcggggaaagccg aaagatttctccagaagttcattccggagccccaatccaacccgtctagttcagcggcct gcagtgggacccgggagcctgcgttttgggaggagatgttctcattccggaggggaagaa cgtctagaaactccttctgctaaaaaattaacagatataggaattcgaagaatcttttct ccagagcatgacattttccggaaaagtgtaaggaagtttttccaagaagaagtgattcct catcactcagaatgggagaaagctggagaagtaagtagggaggtttgggaaaaagctgga aaacaaggactgcttggtgtcaatattgcagagcatcttggtggaattggaggggatctg tactccgcagctattgtctgggaggagcaagcttattcaaattgttcaggcccaggtttt agtattcattcaggtattgtcatgtcctatattacaaaccatggctcagaagaacagatt aagcactttattccccagatgactgcaggcaaatgtattggtgcaatagcaatgacagag cctggagctggaagacagcagcaacttgtgggtcaacagaaagcaaatagtgcaaagaaa gaacttaaagatgccaaagaagttgacttacagggaataaaaacaaatgctaaaaaggat ggaagtgactggattctcaatggaagcaaggtgttcatcagtaatgggtcattaagtgat gttgtgattgtagttgcggtcacaaatcatgaagctccctcccctgcccatggtattagc ctttttctggtggaaaatggaatgaaaggatttatcaagggacgaaagctacataaaatg ggattaaaagcccaggataccgcagaactattctttgaagatatacggttgccagctagt gccctacttggagaagagaataaaggcttctattacatcatgaaagagcttccacaggaa aggctgttaattgctgatgtggcaatttcagctagtgaattcatgtttgaagaaaccagg aactatgttaaacaaagaaaagcttttggcaaaacagttgctcacctacagacagtgcaa cataaattagcagaattaaaaacacatatatgtgtaacccgagcatttgtggacaactgt ctccagctgcatgaagcgaaacgtttggactccgccactgcttgcatggcgaaatattgg tatgcatgctacagtaattag >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_2|125_aa MESIIKSLPSKKSPGPNHFHAEFYQKFKDELSPILRKFYQKTEEERILPNSFYDVGIALI PKPDKDTRKENYRSISLMNMDPKILKMLANQIHQHIKKIIHHDQVEQIHLKIYTISIDAE KKHLI >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_2|378_bp atggaatctataataaaaagtcttccatcaaagaaaagcccaggacctaatcacttccat gctgaattctatcaaaaatttaaagacgaactaagtccaattcttcgcaaattctaccaa aaaactgaagaggagagaattcttccaaactcattctatgatgtcggcattgccctgata ccaaaaccagacaaggacacaagaaaagaaaactatagatcaatatccctgatgaacatg gatccaaaaattctcaaaatgctagcaaaccaaattcaccaacacattaaaaagattatt caccatgatcaagtggaacagatccacttgaaaatatataccatttcaatagatgcagaa aaaaagcatttgatctaa >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_3|224_aa MAPKKDVKKPVAAAAAAPAPAPAPAPAPAPAKPKEEKIDLSAIKSFSADQIAEFKEAFLL FDRTGDSKITLSQVGDVLRALGTNPTNAEVRKVLGNPSNEELNAKKIEFEQFLPMMQAIS NNKDQATYEDFVEGLRVFDKEGNGTVMGAELRHVLATLGEKMKEEEVEALMAGQEDSNGC INYEATLKTSIWNLVSLQLPVGCVRPDTKMGLLSYVSPLWRKLE >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_3|675_bp atggcaccaaagaaagacgtgaagaaacctgtggctgcggctgcggctgccccagccccg gcaccggcacctgcacctgcccctgccccagccaaacccaaagaagaaaaaattgacctc tctgccattaagtccttcagtgctgaccagattgctgaattcaaggaggcatttctcctg tttgacagaacaggtgattccaagatcaccttaagccaggtcggtgatgtccttcgagct ctgggcacaaatcccaccaatgcagaggtcaggaaagttctgggaaaccccagcaatgaa gagctgaatgccaagaaaattgagtttgaacaatttctgcctatgatgcaagccatttcc aacaacaaggaccaggccacctatgaagactttgttgagggtctgcgtgtctttgacaag gaaggcaatggcacagtcatgggtgctgaactccgccatgttctagccaccctgggtgaa aagatgaaagaggaagaagtggaagccctgatggcaggtcaagaagactccaatggctgc atcaactacgaagccactctgaaaaccagtatctggaacctggtgtcactacaactgcct gtaggctgtgtcagacctgacaccaagatgggtctcctcagttatgtctccccactgtgg agaaaattagaatag >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_4|121_aa MADSAFASTCPGQGFLIRAQESIHVSRRPPYGTLGILERLAASVLVFGAVLLEEETAQRA LLCCCKPVPGFLKCETPVTLGEAAGEAGEAGEVASFIPSPLGPNRRSPATFTAVHGERSK G >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_4|366_bp atggctgacagtgcatttgcttctacttgtcctggtcaaggttttctaattcgggcccaa gagagtattcatgtctccaggaggccaccctatggcacgttaggcattcttgagagactg gcggcctcagtccttgtgtttggagcagttctgctagaggaggaaacagcacaaagggca ctcttatgctgctgtaaaccagtgccaggatttttaaaatgtgagacacctgttacccta ggtgaagctgctggagaagctggagaagctggagaagtggcctctttcatcccatcacct ctaggacccaataggaggagccctgccacattcacagcagtacatggggagaggagcaag ggttga >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_5|109_aa MGKKQSRKTGNSKNQSATPPPKEHSSSPATEQSWTENDFDKLREEGFRRSNYSELKEEVR THGKEVKNLEKKLDEWLTRITNAEKSLKYLMELKTKAQELHDKCTSFSS >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_5|330_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgccactcctcct ccaaaggaacacagctcctcaccagcaacagaacaaagctggacagagaatgactttgac aagttgagagaagaaggcttcagacgatcaaactactctgagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaaaattagacgaatggctaactagaata accaatgcagagaagtccttaaagtacctgatggagctgaaaaccaaggcacaagaacta catgacaaatgcacaagcttcagtagctga >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_6|407_aa MGGFNTPLSTLDISMRQKVNKDIQEVNSALDQVDLIDIYRTLHPKSTEHTFFSAPHHTYS KIHHIVGSKVLLSKRKRTEIITNCFSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNTHKRKQERSKIDTLTSQLK ELEKQEQTRSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKVNKIDRLIARLIK KKTEKNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYGNKLENLEEMDKFLNTYTLPRLN QEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEVLVPFLLKLFQSIEKE GLLPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILSKILAN >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_6|1224_bp atgggaggttttaacaccccactgtcaacattagacatatcaatgagacagaaagttaac aaggatatccaggaagtaaactcagctctcgaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaacatacattcttctcagcaccacaccacacctattcc aaaattcaccacatagttggaagtaaagtactgctcagcaaacgtaaaagaacagaaatt ataacaaactgtttctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatacccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacgttcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgagtcc aggagctggttttttgaaaaggtcaacaaaattgatagactgatagcaagactaataaag aagaaaacagagaagaatcaaatagatgcaataaaaaatgacaaaggggatatcaccacc aatcccacagaaatacaaactaccatcagagaatactataaacacctctatggaaataaa ctagaaaatctagaagaaatggataaattcctcaacacatacaccctcccaagactaaac caggaagaagttgaatctctgaataggccaataacaggctctgaaattgaggcaataatt aatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggtgctggtacctttccttctgaaactattccaatcaatagaaaaagag ggactcctccctaactcattttatgaagccagcatcatcctgataccaaagcctggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacatcgatgcaaaaatc ctcagtaaaatactggcaaactga >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_7|192_aa MGPAVAGMVLLEAPGRCDQSGTGVSEAHVAEGYLSLRAGTYLHMPSMGMETGLPSPFQPP LTTHIALGIQSGIIMCAMPATAIMHAMPANQWTENTPTCPDHCSYYCHPTPRNKRNKHSF LTQEGGFQVKKATSIGKGGKYYIKGTPRETKECEQQPSVLDLPSDRGYTNEKEPENQLWY KVGLYQKQNKAL >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_7|579_bp atggggcctgctgtggctggcatggtgctactagaagccccaggtagatgtgatcagagt ggcactggtgtatcagaagcccatgttgctgagggttacctgtcactgagagctggcacc tacctgcacatgccatctatgggcatggagactggtctgcccagtccatttcagccacca ttaacaacgcacatagcacttgggatccagagtggtatcatcatgtgcgccatgccagct actgccatcatgcatgccatgccagctaaccagtggacggagaacacacccacctgccca gaccactgcagttactattgtcatccaacaccgaggaacaagaggaacaagcattcgttt ctaacacaggaagggggctttcaggtcaagaaagccacatccataggaaaagggggaaag tactacatcaaggggacaccccgtgagacaaaagaatgtgaacaacagccttcagtccta gaccttccctctgacagaggctacacaaatgagaaggaaccagaaaaccaactctggtac aaagttggtttgtaccaaaaacaaaacaaggctctttaa >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_8|268_aa MWKSLEVPRDLLNGFAQNADSNMDNKVQTEVVSGRNEEFAGNWSKGLGGKNGFMPGPGSP CCVQSREFLVPCVPVASTVTKRDQGTTQAVTSEGASPKAWHLSRGVEPAGSQKSRIEVWE RLPRFQRMYGNTWLSRQKFAAGAGLSWRTSARAVQKGNVGSKPPHRVPTGALPSGARTRS AKAHLPKESLESSLSSSVLAPFPFDFLPIFFGVMARHYQSLLLQALQEPMDSKGLSPNVK HDNYYSGSPLSYSSNLRRLVIPLGPNST >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_8|807_bp atgtggaaaagtttggaagttcctagagacttgttgaatggctttgctcaaaatgcagac agcaatatggacaataaggtccagactgaggtagtctcaggtagaaatgaggaatttgct gggaactggagcaaaggcctaggagggaaaaatggtttcatgccaggcccagggtccccg tgctgtgtgcagtctagagaatttttggtgccctgtgtcccagttgcttcaacagtgact aaaagggaccaaggtacaactcaggccgtaacttcagagggtgcaagccccaaggcttgg catctttcacggggtgttgagcctgcaggttcacagaagtcaagaattgaggtttgggaa cgtctgcctagatttcagaggatgtatggaaacacctggctgtccaggcagaagtttgct gcaggggcagggctctcatggagaacctctgctagggcagtacagaagggaaatgtaggg tcaaagcccccacacagagtccctactggggcacttcctagtggagctcgtacaagatct gcaaaagcccatctgcccaaagaatccttggagagttctctgagctcctcagttttagct cctttcccattcgatttcttacctatcttctttggagtcatggcaagacactaccagtcc ctgctgctacaggctttgcaggagcctatggattcaaagggcctgagcccaaatgtaaaa catgacaattactattctggctcccctctttcctattcttccaacttgagaaggcttgtt attcctctggggcccaatagtacttga >gi568815596r:210193723_210415042|GENSCAN_predicted_peptide_9|171_aa MELKNTARELCKAYTSINSQINQEEERISEIEDELNDIKDMDEAGNYHSQQTNTRTENQT AHVLTHNWELNNENIWTQRGEHHRGHREGNNENIWTPGPVKRQNDPALVCSLHKDVLGTM GTGPSTVMLTEGSLERTLAKISVYIYMEVLHLSGVMDHFKIPLKDMDPLSE >gi568815596r:210193723_210415042|GENSCAN_predicted_CDS_9|516_bp atggagctgaaaaacacagcacgagaactttgtaaagcatacacaagtatcaatagccaa atcaatcaagaggaagaaaggatatcagagattgaagatgaacttaatgatataaaggac atggatgaagctggaaactatcattctcagcaaactaacacaagaacagaaaaccaaaca gcacatgttctcactcataattgggagctgaacaatgagaacatatggacacagagaggg gagcatcaccggggacacagagaggggaacaatgagaacatatggacaccggggcctgtc aagagacaaaatgatcctgctctggtttgttccctgcacaaagatgttctgggaacaatg ggtacaggaccatccactgtgatgctgacagaaggcagcttggagagaacactcgcgaaa atatctgtttatatatacatggaggttcttcacctttctggtgtcatggaccacttcaag atccccttaaaagatatggatccactctcagaataa