GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:14:29 Sequence gi568815597r:84547436_84771219 : 223784 bp : 39.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1351 1529 179 2 2 101 62 154 0.948 12.82 1.02 Intr + 2997 3095 99 2 0 72 86 43 0.750 1.89 1.03 Intr + 8475 8544 70 2 1 72 90 67 0.030 3.14 1.04 Term + 14233 14336 104 2 2 53 44 102 0.010 -0.24 1.05 PlyA + 14371 14376 6 1.05 2.19 PlyA - 14505 14500 6 1.05 2.18 Term - 14961 14947 15 0 0 99 43 1 0.352 -6.04 2.17 Intr - 15983 15810 174 2 0 93 88 116 0.643 11.31 2.16 Intr - 16397 16300 98 0 2 59 75 79 0.999 2.51 2.15 Intr - 18577 18406 172 1 1 61 99 113 0.694 8.29 2.14 Intr - 22704 22496 209 1 2 65 98 121 0.999 8.67 2.13 Intr - 23285 23147 139 2 1 94 95 134 0.999 13.82 2.12 Intr - 27018 26804 215 1 2 41 60 341 0.092 24.11 2.11 Intr - 27319 27253 67 1 1 54 66 80 0.051 -0.04 2.10 Intr - 34923 34843 81 0 0 83 74 81 0.119 5.12 2.09 Intr - 36445 36292 154 0 1 9 86 143 0.629 5.35 2.08 Intr - 41962 41926 37 2 1 32 115 38 0.088 -2.70 2.07 Intr - 49100 48996 105 0 0 82 103 100 0.259 10.17 2.06 Intr - 50661 50514 148 0 1 94 94 121 0.977 12.19 2.05 Intr - 51847 51748 100 0 1 81 48 81 0.783 2.59 2.04 Intr - 58560 58327 234 2 0 73 47 163 0.875 6.58 2.03 Intr - 59631 59606 26 0 2 122 86 28 0.712 1.91 2.02 Intr - 59829 59793 37 2 1 109 85 22 0.228 1.35 2.01 Init - 67075 67005 71 1 2 82 74 61 0.138 4.67 2.00 Prom - 68644 68605 40 -5.75 3.00 Prom + 68790 68829 40 -5.25 3.01 Init + 72459 72649 191 2 2 63 99 90 0.382 6.13 3.02 Intr + 78250 78346 97 1 1 47 98 78 0.317 3.79 3.03 Term + 84775 84891 117 0 0 54 53 137 0.427 4.36 3.04 PlyA + 85187 85192 6 1.05 4.12 PlyA - 87192 87187 6 1.05 4.11 Term - 100172 99998 175 1 1 75 42 89 0.640 -0.75 4.10 Intr - 103092 102927 166 1 1 97 36 100 0.739 3.80 4.09 Intr - 104562 104448 115 0 1 41 81 102 0.982 3.90 4.08 Intr - 108570 108397 174 0 0 24 53 228 0.712 12.11 4.07 Intr - 109049 108913 137 0 2 49 68 171 0.866 10.57 4.06 Intr - 111033 110883 151 0 1 100 86 135 0.893 13.31 4.05 Intr - 114940 114763 178 0 1 72 93 129 0.886 10.80 4.04 Intr - 115095 115020 76 1 1 56 103 54 0.781 1.35 4.03 Intr - 117117 116982 136 0 1 80 65 115 0.784 7.62 4.02 Intr - 122458 122246 213 1 0 74 70 110 0.861 5.79 4.01 Init - 123342 123211 132 0 0 65 92 4 0.141 -1.31 4.00 Prom - 124580 124541 40 -6.65 5.00 Prom + 126981 127020 40 -3.55 5.01 Init + 134236 134553 318 0 0 39 54 195 0.783 8.58 5.02 Intr + 142632 142826 195 2 0 58 55 150 0.955 7.39 5.03 Intr + 143380 143568 189 0 0 87 37 91 0.711 2.76 5.04 Term + 146835 147191 357 2 0 9 44 337 0.247 14.93 5.05 PlyA + 148941 148946 6 1.05 6.00 Prom + 159424 159463 40 -5.55 6.01 Init + 159821 159897 77 1 2 92 70 15 0.030 0.71 6.02 Term + 171385 171550 166 1 1 85 53 125 0.615 5.11 6.03 PlyA + 173576 173581 6 1.05 7.03 PlyA - 174386 174381 6 1.05 7.02 Term - 177322 177162 161 2 2 53 42 156 0.916 4.62 7.01 Init - 178676 178604 73 2 1 83 67 86 0.461 7.28 7.00 Prom - 187504 187465 40 -4.85 8.10 PlyA - 189868 189863 6 1.05 8.09 Term - 191033 190858 176 0 2 27 49 156 0.654 2.54 8.08 Intr - 197735 197617 119 1 2 -6 94 93 0.035 -0.31 8.07 Intr - 201795 201629 167 0 2 68 29 118 0.202 1.74 8.06 Intr - 202784 202719 66 2 0 106 71 58 0.781 4.08 8.05 Intr - 205068 204940 129 0 0 105 90 46 0.981 6.47 8.04 Intr - 205939 205822 118 0 1 -76 91 196 0.462 3.05 8.03 Intr - 207002 206975 28 0 1 63 80 42 0.250 -2.84 8.02 Intr - 212178 211880 299 2 2 21 59 180 0.014 3.99 8.01 Init - 217992 217877 116 0 2 89 80 103 0.418 9.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 8475 8582 108 2 0 72 42 106 0.864 1.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_1|150_aa XYNGWKKKYLETKKVTASMEEVLTKLREDLELYYKKLLMQLEAREIKMRPKNLANITDSK NYLIIQITEVQHAIDQLKRKLDTDKMKLIVEVKLLEVCLARRSTEDFVYDESAKSKEIAI ATPTFSNHHPDQSAAINDKARHSSSKQVIT >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_1|453_bp ncctacaatggttggaagaaaaaatacttggaaacaaagaaagtcacagcatcaatggag gaggttttaacaaaacttcgagaagatttggaactctactataaaaaactgctcatgcaa cttgaagccagggagatcaagatgagaccaaagaatctggcaaacatcacagactccaag aattacctaataatccagatcactgaggtacagcatgcaattgaccagcttaagagaaaa ctagatactgacaaaatgaaactcatagtagaagttaagttgctagaagtgtgcctggct aggaggtctacagaagattttgtttatgatgaatcagccaagtcaaaagaaattgccata gccaccccaaccttcagcaaccaccaccctgatcagtcagcagccatcaatgacaaggca agacactcttccagcaagcaggttataacttga >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_2|693_aa MMATILDYEHEDHTQGKERQQAGRPWPMGEMSTQTQGMLHGDDPRGRSEVWAVGSFGDGG AGGADGAAGGGPGQRGLEGLGHLEAASNGSSKTCSSDLGELLSTDLSDMIAKRAELVLQC LQRVNTGGDSPPSLQGELSLYSIPKGSPERSCKVELLAVLEFAHQICHPRRPKVQQDDCE QDQGPQPLQKKVEDYVHERHSRLSSAFCHVRMWHKGAIWEQKLDPHQTPDLPEFKSIRGH GLFDKVQLFILAREGPAARKYRLVHIRPDHREMVKVRQRNSDFTQEFLLVLKQENEKWPQ GIRKDPLVLMNGVIHKPEDLVETSVLTHHIPTPQGVHDQPRQNEMQTIGRTINGGRNPLR PARPAAMSRPQLRRWRLVSSPPSGVPGLALLALLALLALRLAAGTDCPCPEPELCRPIRH HPDFEVFVFDVGQKTWKSYDWSQITTVATFGKYDSELMCYAHSKGARVVLKGDVSLKDII DPAFRASWIAQKLNLAKTQYMDGINIDIEQEVNCLSPEYDALTALVKETTDSFHREIEGS QVTFDVAWSPKNIDRRCYNYTGIADACDFLFVMSYDEQSQIWSECIAAANAPYNQTLTGY NDYIKMSINPKKLVMGVPWYGYDYTCLNLSEDHVCTIAKVPFRGAPCSDAAGRQVPYKTI MKQINSSISGNLWDKDQRAPYYNYKVRLFRALV >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_2|2082_bp atgatggctaccatcttggactatgaacatgaggatcacacccaagggaaggagaggcag caagctggaagaccttggccgatgggtgaaatgagtactcagacacagggaatgttgcat ggggatgatccaaggggtaggagtgaagtctgggcagtggggagctttggagatggtgga gctggaggagctgatggtgcagctggagggggccctggacaaaggggactggaaggactg gggcacttagaagctgcctctaatggttcctccaaaacttgctcttctgacttaggggaa ctactctccacagacctgtctgatatgattgctaagagggctgagttggttttgcaatgc ttgcaaagagttaacactggaggagactcgcccccaagtttgcagggagagctaagtcta tatagcattcctaaagggagcccagagaggtcctgtaaggtggaactgttggcagttttg gagtttgcccaccagatatgccatccaagaagaccaaaagtacagcaggatgattgtgaa caagaccagggacctcagcccttgcaaaaaaaggttgaagactatgtacatgaacgacat tctaggctatcctcagccttctgccatgtgaggatgtggcacaaaggtgccatttgggag cagaaactggaccctcaccagacaccagacctgccagagtttaagagcattagaggccac ggtttatttgataaagtccagttgttcatccttgcaagagaagggcctgcagcgcggaaa tacagacttgtccacattaggccagatcacagagaaatggtaaaagtccggcagaggaat tcggacttcactcaggagttcctgctggttctgaagcaggaaaatgagaagtggcctcag ggaattagaaaggatccactggtgttgatgaatggagtaatccacaagccggaggatctg gtggaaacatcggtgctcacacatcacattccgactccacaaggagtacacgaccagcca cgacaaaacgagatgcagactattgggaggactataaacggcggtaggaacccactccgg cccgctagacctgctgctatgtcccggccgcagcttcgacgctggcgcctcgtctctagc ccgccgagcggcgtcccgggtctagcgctgctggcgctgctggcgctgctggcgctgcgg ctcgcggccgggaccgactgcccatgcccggagcctgagctctgccgcccgattcgccac catccagatttcgaggtctttgtgtttgatgttggacagaaaacttggaaatcttatgat tggtcacagattacaactgtggcaacatttggaaaatatgactcagaacttatgtgctac gctcattcaaaaggagccagagtagtacttaaaggagatgtatccttaaaggatatcatt gatcctgctttcagagcatcctggatagctcaaaaacttaatttggccaaaacacaatat atggatggaattaatatagatatagagcaagaagttaattgtttatcacctgaatatgat gcattaactgctttagtcaaagaaactacagactctttccatcgtgaaattgagggatca caggtaacctttgatgtagcttggtctccaaagaacatagacagaagatgctataattat actggaatcgcagatgcttgtgacttcctctttgtgatgtcttatgatgaacaaagtcag atctggtcagaatgtattgcagcagccaatgctccctataatcagacattaactggatat aatgactacatcaagatgagcattaatcctaagaaacttgtaatgggtgttccttggtat ggttatgattatacctgcctgaatctgtctgaggatcatgtttgtaccattgcaaaagtc cctttccggggggctccttgtagtgacgctgcaggacgtcaggtgccctacaaaacgatc atgaagcaaataaatagttctatttctggaaacctatgggataaagatcagcgggctcct tattataactataaagtaagacttttcagagctttagtgtag >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_3|134_aa MSNLRPRKTTQEALSKWSRGSRMCLVLYISGGQELQAKTEINTGHIYISSAEKAGYLEVG GLQRNSRRNALDNSFQTSMRRISCDFNAEVINFNLQVTHDHYHRPLRTIMSKTVDFGRQQ KLVLDITAAQHYRL >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_3|405_bp atgagtaacctccgcccccggaaaacaacccaagaagccttgagtaagtggtcccgaggc agccgcatgtgtttggttttatacatttcaggagggcaggagttacaggcgaagacagaa atcaatacagggcacatatacattagttcagcagaaaaggcaggctatcttgaagtaggg ggcttacagaggaattccaggagaaacgctttagacaactccttccaaacctcaatgaga agaatatcctgtgacttcaatgcggaagtgatcaatttcaacctgcaggttacacatgac cattatcaccgtcctctaaggaccataatgtcaaaaacagtggacttcggccgccagcag aaacttgttctagatataacagctgcccagcactaccgattatga >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_4|550_aa MSPSSLYSQQVLCSSIPLSKNVHSFFSAFCTEDNIEQSISYLDQELTTFGFPSLYEESKG KETKRELNIVAVLNCMNELLVLQRKNLLAQENVETQNLKLGSDMDHLQSCYSKLKVQKLQ NIIASRATQYNHDMKRKEREYNKLKERLHQLVMNKKDKKIAMDILNYVGRADGKRGSWRT GKTEARNEDEMYKILLNDYEYRQKQILMENAELKKVLQQMKKEMISLLSPQKKKPRERVD DSTGTVISDVEEDAGELSRESMWDLSCETVREQLTNSIRKQWRILKSHVEKLDNQVSKVH LEGFNDEDVISRQDHEQETEKLELEIQQCKEMIKTQQQLLQQQLATAYDDDTTSLLRDCY LLEEKERLKEEWSLFKEQKKNFERERRSFTEAAIRLGLERKAFEEERASWLKQQFLNMTT FDHQNSENVKLFSAFSGSSDWDNLIVHSRQPQKKPHSVSNGSPVCMSKLTKSLPASPSTS DFCQTRSCISEHSSINVLNITAEEIKPNQVGGECTNQKWSVASRPGSQEGCYSGCSLSYT NSHVEKDDLP >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_4|1653_bp atgtctccatcaagtttatactcacagcaagtgctatgttcttcaatacctttatcgaaa aatgtgcacagttttttcagtgccttctgcacagaagataatattgaacagagtatctca tatcttgatcaggaattgactacttttggttttccttcattatatgaagaatccaaaggt aaagagacaaagagagagttaaatatagtagctgtactaaattgtatgaatgagctgctt gtgcttcagcggaagaaccttctagctcaggaaaatgtggagacacagaatttgaagctg ggaagtgatatggaccatctacagagctgctactcaaaacttaaggtgcaaaaattacaa aatatcattgcaagtcgagctactcagtataatcatgatatgaagagaaaagagcgtgaa tataataaactgaaggaacgtctacatcaacttgttatgaacaagaaagataagaaaata gctatggacattttgaattatgtcgggagagctgatggaaaaagaggctcctggaggact ggtaaaactgaagccaggaatgaagatgaaatgtataaaattctcttgaatgattatgaa tatcgtcagaaacaaatcctaatggaaaatgcagaacttaagaaggttcttcaacaaatg aaaaaggaaatgatttctcttctttctccccaaaagaagaaacctagagaaagagtagat gatagtacaggaactgttatttccgatgttgaagaagatgccggggaactaagcagagag agtatgtgggacctttcctgtgaaactgtgagagagcagcttacaaacagcatcagaaaa cagtggagaattttgaaaagtcatgtagaaaagcttgataaccaagtttcaaaggtacac ctggaaggttttaatgatgaagatgtaatctcacgacaagaccatgaacaagaaactgaa aaactcgagttagaaattcagcagtgtaaagaaatgattaaaactcagcaacagctttta cagcagcagctcgctactgcatatgatgatgataccacttcactattacgagactgttat ttgttggaagaaaaggaacgtctcaaagaagaatggtccctttttaaagagcagaaaaag aattttgagagggagagacgaagctttacagaagccgctattcgcctgggattggagaga aaggcatttgaagaagaaagagccagttggttaaagcagcagtttctaaatatgactacc tttgaccaccagaactcagaaaatgtgaaacttttcagtgccttctcaggaagttctgat tgggacaatcttatagtgcactcgaggcagccgcaaaagaagcctcacagtgtgtctaat gggtctccagtttgcatgtctaaacttactaaatctcttcctgcttcaccttccacttca gacttttgccagacacgttcctgcatatctgaacatagttcaatcaatgtactgaatata actgctgaagaaattaaaccaaatcaggttggaggagaatgtacaaatcaaaaatggagt gtggcatcaagacctggatcacaggaaggttgctatagtggatgctccttgagctacaca aattctcatgtagaaaaagatgacttaccttag >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_5|352_aa MCESTLSSIPKSSSPVLEWKSRLRQIPPIRYTSSTCSTYVRIPMSKGNAVFIPEGMSVGI WPIRKTGNRNTEARLFPNYHQGQSHGVLADLAKVDSPPGLMNNPTLRGFGAPRVRALRRR SSANASESGPGASTYLPPPPPGAARVDARPHPATAPTAARSLPPNNGALSQRPGPGLCGG RDSAGTGVVATLEHVFRAPLNPNTQPSSKSEASRRATCVCQSGEINCGHLELGQMPLRTF MGTEGGQNISTLRGVWKKLIKTLIYDFEAFKTSVEEVTGDVVDIAIKLELEVEPEDVTEL LQSQDKTGTDEELLLMDEQRKWYLEGESTPGEDAVNVVQMTTKDLEYSINLN >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_5|1059_bp atgtgtgaaagtactcttagctcaatcccaaaaagtagcagccctgtccttgagtggaaa tctcgacttagacaaatcccaccaatcaggtacacttctagcacctgttccacatatgta cgcatacccatgagcaagggtaatgcagtcttcatacctgagggaatgtctgtgggaatc tggcccatcaggaagacagggaacagaaatacagaggccagacttttccccaactatcat caagggcaaagccatggagtactggcagacctggcaaaagtggactctccccctggctta atgaacaatccaaccttgagagggtttggggctccccgggtcagagcgctccgcaggcgc agctccgcgaacgccagcgagagcggtcccggggccagcacctacctcccacctcctcct ccaggagcggcccgggtcgacgctcgcccgcaccctgcaacggccccaacagccgcgcgg agcctcccgccgaacaacggggcgctgagccagcgtccggggccggggctctgtgggggc cgggactctgcagggaccggggtcgtggctactttagagcacgttttccgagccccactg aaccccaacacccagcctagcagcaaaagtgaagcttccaggagagccacgtgcgtttgc caaagtggagaaataaactgcgggcacctagaattaggtcagatgccattgagaacattc atgggcacggaaggaggtcaaaatatatcaacattaagaggagtttggaagaagttgatt aaaaccctcatatatgactttgaggcgttcaagacttcagtggaagaagtaactggagat gtggtagacatagcaataaaactagaactagaagtggagcctgaagatgtgactgaattg cttcaatctcaggataaaactggaacagatgaagagctacttcttatggatgaacaaaga aagtggtatcttgagggggaatctactcctggtgaagatgctgtgaacgttgttcaaatg acaacaaaggatttagaatattccataaacttaaactga >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_6|80_aa METSFVKFMLPNQMVSHKSSTSHLTQVVEKNLSQFQDSPELLPVIVSKLQEAPKAPSVGL GGMRGSLSDPQSLLFAFAIC >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_6|243_bp atggaaacctcttttgtcaagttcatgttaccaaatcaaatggtcagtcataagtcctca acttctcatcttactcaggtggtagaaaaaaacctcagtcaattccaggattctcctgaa cttttacctgtaattgtatctaaattgcaagaggctcctaaagctccaagtgttggactt ggaggtatgcggggaagtctatctgatcctcaaagtcttctgttcgcatttgccatctgc tga >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_7|77_aa MAEGEGEAGTSYMAGEGGRETEGKAHTVSSVLGGDAETPSPDAKEELISQSAESTGAVRE LLTGTGPGTGNLGTCYH >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_7|234_bp atggcagaaggtgaaggggaagcaggcacatcctacatggctggagaaggaggaagagag acagaagggaaagcacatactgtgagcagtgtcctaggaggtgatgccgaaacgcccagc cctgacgctaaagaggaattgatttctcaaagtgcagagtcaacaggtgccgtaagagaa ctgctgacaggaacaggccctgggaccgggaatttgggaacctgctaccattaa >gi568815597r:84547436_84771219|GENSCAN_predicted_peptide_8|405_aa MAPKDVHVLNPGFSEYVTLHDKRGFTDALKDFKIRKLSREQLGHFEPQGLPASSAQIAGE GEVGVYYLCSPFKTNPSKEQSLPSAGIGGSSDIWRSPVLAAEEWASGLREGTACSHLLGA SSFHSELHSNCETGPREGGGDRREIASRIRRAVDLEGIGEECTKKRDSEAHQGCCTMREE VVARDEQTWDLEAGTALPTLVTTTISTGTQQVLNKSWMGGWMHGKMGRLRCLIEAFALLM ECEACRTFSLPKTPESFCGRFKEQFILNTEILANAGITELGHKKVPDPTAHCPQSPQEQS QTGDDPLWFRVSTQELAYDQSANRHTTREYSKVGGRETSEKAYAISPVSRAALALLLSFP HPQRLRDRWARTLPRIITSRSSPLFGLKVLLNLESHEISQQILAV >gi568815597r:84547436_84771219|GENSCAN_predicted_CDS_8|1218_bp atggcccccaaagatgtccatgtcctaaaccctggattcagcgaatatgttaccttacat gacaaaaggggctttactgatgcgcttaaggattttaagatacggaaattgtccagggaa caattggggcactttgagccacaaggactgccggcaagctctgcccagattgctggggaa ggagaggtgggtgtgtattatttatgttcccctttcaaaacaaacccatcaaaagaacaa agcctcccttctgctggcatcggcggcagctctgacatatggcgctccccagttcttgct gccgaggagtgggcgtcggggctgcgtgaaggtactgcctgcagccatctgctgggtgcc agttctttccactccgagctgcattctaattgtgaaacagggccacgggaaggagggggt gaccgacgagagatagcaagcaggataagaagggccgtggacctggaagggataggtgaa gagtgtacgaagaagagggacagcgaagcccaccagggctgctgcacaatgagggaagag gttgtggccagggatgagcagacttgggatctagaggcaggcactgccctacccacctta gtaaccaccacgattagcacaggaacacagcaggttctcaataagagctggatgggagga tggatgcatgggaaaatgggcaggctcaggtgtctcatagaagcctttgcattattgatg gaatgtgaggcatgccgcacattttcacttccaaagacccctgaatctttctgtggaaga ttcaaagaacaatttattttaaacactgaaatcttagcaaatgctggcatcactgagctt ggtcataaaaaagtccctgacccaactgctcactgtccccagagtcctcaggagcagagc caaactggagatgacccactctggttcagggtgagcacacaggaacttgcttatgatcag tcagctaacagacataccactagagaatacagcaaggttggaggtagagagacaagtgag aaagcttatgccatcagtccagtctcccgtgcggccctggcccttctgctgagctttccg catccacaacggctccgcgatcggtgggcccggaccttgccacgcatcatcactagccgc agctctcctttatttgggctgaaagtattgttgaaccttgaatcccatgagatttctcag cagattttggctgtgtga