GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:42:59 Sequence gi568815597f:99893555_100111533 : 217979 bp : 38.10% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7082 7307 226 2 1 99 63 53 0.238 0.96 1.02 Intr + 9129 9240 112 2 1 60 87 135 0.654 9.63 1.03 Intr + 15511 15795 285 2 0 67 81 137 0.258 7.29 1.04 Intr + 17158 17293 136 2 1 93 67 117 0.988 8.81 1.05 Intr + 18851 18963 113 2 2 24 103 71 0.995 1.30 1.06 Intr + 19973 20184 212 0 2 98 66 118 0.965 8.31 1.07 Intr + 21835 21932 98 0 2 48 89 68 0.684 0.79 1.08 Intr + 22856 22943 88 2 1 60 58 66 0.372 -0.15 1.09 Intr + 23044 23177 134 0 2 56 107 42 0.464 1.42 1.10 Term + 42667 42823 157 1 1 55 54 126 0.324 2.32 1.11 PlyA + 43624 43629 6 1.05 2.00 Prom + 66762 66801 40 -3.65 2.01 Init + 68034 68039 6 2 0 71 105 17 0.462 1.63 2.02 Intr + 71820 72030 211 2 1 38 106 115 0.778 5.86 2.03 Intr + 76114 76316 203 2 2 90 96 185 0.797 17.58 2.04 Intr + 76396 76608 213 0 0 -63 105 375 0.295 22.09 2.05 Intr + 99983 100187 205 1 1 101 42 114 0.877 5.95 2.06 Intr + 105707 105861 155 0 2 102 80 69 0.991 6.37 2.07 Intr + 113480 113602 123 1 0 113 93 13 0.937 4.16 2.08 Intr + 116560 116691 132 0 0 32 94 68 0.764 1.72 2.09 Intr + 117811 117979 169 0 1 83 111 64 0.989 6.80 2.10 Intr + 121748 121866 119 0 2 71 103 10 0.501 0.06 2.11 Term + 122419 122478 60 0 0 72 33 71 0.384 -3.17 2.12 PlyA + 123441 123446 6 1.05 3.03 PlyA - 123870 123865 6 1.05 3.02 Term - 145461 144841 621 0 0 78 44 488 0.287 36.82 3.01 Init - 149455 149450 6 1 0 94 94 0 0.268 2.13 3.00 Prom - 151327 151288 40 -5.15 4.00 Prom + 153418 153457 40 -3.55 4.01 Init + 160238 160240 3 1 0 89 89 0 0.492 0.25 4.02 Intr + 165112 165174 63 0 0 102 97 68 0.984 7.10 4.03 Intr + 166324 166451 128 0 2 71 80 45 0.995 0.56 4.04 Intr + 168281 168395 115 2 1 127 84 51 0.996 8.13 4.05 Intr + 174423 174641 219 2 0 107 115 176 0.999 19.78 4.06 Intr + 176060 176131 72 1 0 69 109 21 0.660 0.98 4.07 Intr + 183611 183723 113 1 2 127 111 -8 0.962 3.66 4.08 Intr + 184897 184994 98 1 2 101 96 137 0.986 14.53 4.09 Intr + 186942 187105 164 1 2 31 91 105 0.835 3.87 4.10 Term + 188449 188655 207 0 0 87 49 38 0.758 -3.84 4.11 PlyA + 189785 189790 6 1.05 5.11 PlyA - 190038 190033 6 1.05 5.10 Term - 191880 191774 107 1 2 126 31 50 0.928 0.69 5.09 Intr - 192076 191982 95 0 2 64 115 35 0.936 2.49 5.08 Intr - 194682 194585 98 0 2 106 106 13 0.914 2.79 5.07 Intr - 209529 209401 129 0 0 102 63 52 0.923 4.07 5.06 Intr - 212349 212213 137 1 2 55 83 71 0.877 2.67 5.05 Intr - 213439 213358 82 1 1 63 84 51 0.900 0.49 5.04 Intr - 213999 213820 180 0 0 -21 53 207 0.925 5.44 5.03 Intr - 214163 214074 90 2 0 27 76 97 0.761 1.67 5.02 Intr - 214450 214256 195 1 0 130 19 197 0.999 15.69 5.01 Intr - 216929 216738 192 2 0 57 57 151 0.971 7.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:99893555_100111533|GENSCAN_predicted_peptide_1|520_aa XNIILAFAGTLRHGLIPNLLGEGIYARYNCRDAVWWWLQCIQDYCKMVPNGLDILKCPVS RMYPTDDSAPLPAGTLDQPLFEVIQEAMQKHMQGIQFRERNAGPQIDRNMKDEVGSVRGT PSLLGRAWKSGLSTQSLLTACCQERGALSLTTCFHWGWKSILPTQFVLTSCSRGGKHPEL PAATPRPPTTARLGVEVHTPQIPLVVPAGFNITAGVDEETGFVYGGNRFNCGTWMDKMGE SDRARNRGIPATPRDGSAVEIVGLSKSAVRWLLELSKKNIFPYHEVTVKRHGKAIKVSYD EWNRKIQDNFEKLFHVSEDPSDLNEKHPNLVHKRGIYKDSYGASSPWCDYQLRPNFTIAM VVAPELFTTEKAWKALEIAEKKLLGPLGMKTLDPDDMVYCGIYDNALDNDNYNLAKGFNY HQGPEWLWPIGYFLRAKLYFSRLMGPETTAKTIVLVKNVLSRHYVHLERRRARELPNAGI SHNTSRPSGQGASSSRPRAKQGPGSQGAAPAVDVLSQQSG >gi568815597f:99893555_100111533|GENSCAN_predicted_CDS_1|1563_bp nngaatattattttagcatttgcgggtaccctgaggcatggtctcattcctaatctactg ggtgaaggaatttatgccagatacaattgtcgggatgctgtgtggtggtggctgcagtgt atccaggattactgtaaaatggttccaaatggtctagacattctcaagtgcccagtttcc agaatgtatcctacagatgattctgctcctttgcctgctggcacactggatcagccattg tttgaagtcatacaggaagcaatgcaaaaacacatgcagggcatacagttccgagaaagg aatgctggtccccagatagatcgaaacatgaaggacgaagttgggagtgtgaggggcacc ccatcactgctgggcagggcatggaaatccggactctccactcagtctctgctgacagca tgctgtcaggagaggggagcactgtccctaacgacctgcttccactgggggtggaagtct atactccccacccagtttgtgctaacatcatgcagcaggggtgggaaacaccctgagctg cctgctgccaccccaaggccacctactactgccaggttgggtgtggaagttcatactccc cagattcccctcgtggtgcccgcaggttttaatataactgcaggagttgatgaagaaaca ggatttgtttatggaggaaatcgtttcaattgtggcacatggatggataaaatgggagaa agtgacagagctagaaacagaggaatcccagccacaccaagagatgggtctgctgtggaa attgtgggcctgagtaaatctgctgttcgctggttgctggaattatccaaaaaaaatatt ttcccttatcatgaagtcacagtaaaaagacatggaaaggctataaaggtctcatatgat gagtggaacagaaaaatacaagacaactttgaaaagctatttcatgtttccgaagaccct tcagatttaaatgaaaagcatccaaatctggttcacaaacgtggcatatacaaagatagt tatggagcttcaagtccttggtgtgactatcagctcaggcctaattttaccatagcaatg gttgtggcccctgagctctttactacagaaaaagcatggaaagctttggagattgcagaa aaaaaattgcttggtccccttggcatgaaaactttagatccagatgatatggtttactgt ggaatttatgacaatgcattagacaatgacaactacaatcttgctaaaggtttcaattat caccaaggacctgagtggctgtggcctattgggtattttcttcgtgcaaaattatatttt tccagattgatgggcccggagactactgcaaagactatagttttggttaaaaatgttctt tcccgacattatgttcatcttgagaggagaagagctcgggaactgcctaatgcgggtata agtcataacacaagcaggccgagtgggcagggtgcctccagcagcaggcccagggccaag caaggcccaggcagtcagggggcagcaccagctgtggatgtcctcagtcagcaaagtggc tga >gi568815597f:99893555_100111533|GENSCAN_predicted_peptide_2|531_aa MLTLEHQVITEALINKPHRVAIMQGSFSSFRMYRGRKGPHLPEISSWAPDSKNKSTIGKN QKPLKYSSSEMKRFFAPKDFALPYAVDTLAPSANRQSRRGPRRVSLHLFRAPRRGVAESS APGLHATVKQEPGAASRAAQPRRKPEHLGDGTAGTRKRLGCTELAVATATGAGGAAGQRS RRTQRTRQPGAPGGAEPRPPVVGSAGRAAREANEDKTMFANLKYVSLGILVFQTTSLVLT MRYSRTLKEEGPRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPME TLKLAIPSGIYTLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLS LVILMTGVAFVQPLKYSDDPIEGEDTEHFHHPKSQLTRINIKIGGENTEKSRLFQEWPSD SQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLGFFGSIFG LMGVYIYDGELVSKNGFFQGYNRLTWIVVVLQRYLKPRKQGSRKEGEDGKG >gi568815597f:99893555_100111533|GENSCAN_predicted_CDS_2|1596_bp atgctgactttagagcatcaagtgataactgaagctttgatcaacaagccccatagggta gcaatcatgcagggtagcttctccagtttccgaatgtacagagggaggaaagggcctcat ctccctgaaatctcctcctgggcccctgactcgaagaataagtctactataggaaaaaac cagaagccactgaaatactcttcctcagaaatgaaaaggttctttgccccaaaggacttt gcccttccttacgcagtggacaccttggcaccctctgcaaacagacagtcccgcagaggc cctcgacgagtgtcactccatctgtttcgcgctcctaggagaggtgtagcagaaagcagt gccccaggcctccacgcgacggtgaagcaggaaccaggggctgccagcagagcagcccag ccgaggcggaagccggagcatttgggggacgggaccgcgggaacccggaaacgcctgggc tgcaccgagctggcggtggctacggcgacgggagccggcggcgctgcgggtcagcggtcg cgtaggacccagcggactcggcagcctggggcgcccggcggagctgaaccgcggcccccg gtggtgggctcagccggtcgagctgcgcgggaggcaaatgaagataaaacaatgttcgcc aacctaaaatacgtttccctgggaattttggtctttcagactaccagtttggttctaaca atgcgttattccagaactttaaaagaagaaggacctcgttatctatcttctacagcagtg gttgttgctgaacttttgaagataatggcctgcattttattggtctacaaagacagcaaa tgtagtctaagagcactgaatcgagtactacatgatgaaattcttaataaacctatggaa acacttaaacttgctattccatcagggatctatactcttcagaataatttactgtatgtg gcactatcaaatctagatgcagctacttatcaggtcacgtatcagttaaaaattcttaca acagcattattttctgtgtctatgcttagtaaaaaattgggtgtataccagtggctgtcc ctagtaattttgatgacaggagttgcttttgtacagccacttaaatatagcgatgaccct attgagggtgaagatacagaacatttccatcatccaaagagtcagttgactagaataaat ataaagattggaggagagaacacggaaaagagcaggttgtttcaagagtggccctcagat tctcagcttgattctaaggaactttcagctggttctcaatttgtaggactcatggcagtt ctcacagcatgtttttcaagtggctttgctggggtttactttgagaaaatcttaaaagaa acaaaacaatcagtgtggataagaaatattcagcttggtttctttggaagtatatttgga ttaatgggtgtatacatttatgatggagaactggtatcaaagaatggattttttcaggga tataaccgactgacctggatagtagttgttcttcagagatatttgaagcctaggaagcaa ggctccagaaaggaaggtgaagatggcaaaggatag >gi568815597f:99893555_100111533|GENSCAN_predicted_peptide_3|208_aa MEAMTDSNTFDLKTHSDDFSLKTSRFECRPQVSAVMLPTPGSEGSPSPVRVYCKQPRAGA PTQRDVETESERTTGPQRDPRARRHPVSNIWESRDGSQPLDKRGDGSAGPGGEGEARLRR RGARRAGRRGSPAHSSPRRRRTWGSRPQGLQRRPEQRAAGAGKKEDGPRRPQARAAEDDS RSGGMRVRDPEAPAARTGWGKGGRHRDA >gi568815597f:99893555_100111533|GENSCAN_predicted_CDS_3|627_bp atggaggccatgactgattctaatacttttgatcttaagacacacagtgatgacttttct ttgaaaactagcaggttcgaatgtaggccccaggtgtctgccgtcatgcttccgactccg ggaagtgagggaagcccatctccggtcagggtttactgtaagcagcccagagccggagca ccgacgcagagagacgtggaaacagaaagcgaacgtactacggggccgcagcgggatccc cgcgccagacgccaccccgttagcaacatatgggagagcagggatggttcccagcccctt gacaagcgcggcgacggctccgcggggcctggcggcgaaggcgaggcgaggctgcggcga cgaggggcgcggcgggcgggccgccgggggagccccgcacattcctccccgcggcggagg cgcacgtggggttcgaggccgcagggcctgcagaggcggccagagcagagggcggcaggt gcgggaaagaaagaggacgggccccggaggccgcaagcccgagcagcggaggacgattcc cggagcggcggcatgcgggtccgcgaccccgaggcaccggcagctcggacagggtggggg aagggagggaggcaccgggacgcctag >gi568815597f:99893555_100111533|GENSCAN_predicted_peptide_4|393_aa MVLHETFPKHTFLMNGLIQGVKGLLSFLSAPLIGALSDVWGRKSFLLLTVFFTCAPIPLM KISPWWYFAVISVSGVFAVTFSVVFAYVADITQEHERSMAYGLVSATFAASLVTSPAIGA YLGRVYGDSLVVVLATAIALLDICFILVAVPESLPEKMRPASWGAPISWEQADPFAIMKF SPESVAAFIAVLGILSIIAQTIVLSLLMRSIGNKNTILLGLGFQILQLAWYGFGSEPWMM WAAGAVAAMSSITFPAVSALVSRTADADQQGVVQGMITGIRGLCNGLGPALYGFIFYIFH VELKELPITGTDLGTNTSPQHHFEQNSIIPGPPFLFGACSVLLALLVALFIPEHTNLSLR SSSWRKHCGSHSHPHNTQAPGEAKEPLLQDTNV >gi568815597f:99893555_100111533|GENSCAN_predicted_CDS_4|1182_bp atggtattacatgaaacctttcctaaacatacatttctgatgaacggcttaattcaagga gtaaagggtttgttgtcattccttagtgccccgcttattggtgctctttctgatgtttgg ggccgaaaatccttcttgctgctaacggtgtttttcacatgtgccccaattcctttaatg aagatcagcccatggtggtactttgctgttatctctgtttctggggtttttgcagtgact ttttctgtggtatttgcatacgtagcagatataacccaagagcatgaaagaagtatggct tatggactggtttcagcaacatttgctgcaagtttagtcaccagtcctgcaattggagct tatcttggacgagtatatggggacagcttggtggtggtcttagctacagcaatagctttg ctagatatttgttttatccttgttgctgtgccagagtcgttgcctgagaaaatgcggcca gcatcctggggagcacccatttcctgggaacaagctgacccttttgcgataatgaaattt tcaccagaaagtgttgcagcgtttatagcagtccttggcattctttccattattgcacag accatagtcttgagtttacttatgaggtcaattggaaataagaacaccattttactgggt ctaggatttcaaatattacagttggcatggtatggctttggttcagaaccttggatgatg tgggctgctggggcagtagcagccatgtctagcatcacctttcctgctgtcagtgcactt gtttcacgaactgctgatgctgatcaacagggtgtcgttcaaggaatgataacaggaatt cgaggattatgcaatggtctgggaccggccctctatggattcattttctacatattccat gtggaacttaaagaactgccaataacaggaacagacttgggaacaaacacaagccctcag caccactttgaacagaattccatcatccctggccctcccttcctatttggagcctgttca gtactgctggctctgcttgttgccttgtttattccggaacataccaatttaagcttaagg tccagcagttggagaaagcactgtggcagtcacagccatcctcataatacacaagcgcca ggagaggccaaagaacctttactccaggacacaaatgtgtga >gi568815597f:99893555_100111533|GENSCAN_predicted_peptide_5|434_aa AQVQYQQQHEQQKKDLEILHQQNIHQLQNRLSELEAANKDLTERKYKGDSTIRELKAKLS GVEEELQRTKQEVLSLRRENSTLDVECHEKEKHVNQLQTKVAVLEQEIKDKDQLVLRTKE AFDTIQEQKVVLEENGEKNQVQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKL KNTVTIQQEKLLAEKEEKLQKEQKELQDVGQSLRIKEQEVCKLQEQLEATVKKLEESKQL LKNNEKLITWLNKELNENQLVRKQDVLGPSTTPPAHSSSNTIRSGISPNLNVVDGRLTYP TCGIGYPVSSAFAFQNTFPHSISAKNTSHPGSGTKVQFNLQFTKPNASLGDVQSGATISM PCSTDKENGENVGLESKYLKKREDSIPLRGLSQNLFSNSDHQRDGTLGALHTSSKPTALP SASSAYFPGQLPNS >gi568815597f:99893555_100111533|GENSCAN_predicted_CDS_5|1305_bp gcacaggttcaatatcaacagcagcatgaacaacagaaaaaagatttagaaatcctccat caacaaaacatccaccagctacaaaacagactgtctgagttagaagcggctaataaagac ttaaccgaaagaaaatataaaggagactccactattagagaacttaaagcaaaactttct ggtgttgaagaggagctacagcggactaagcaagaagtcctctctttgcgaagagagaat tctacactagatgttgaatgccacgagaaagaaaagcacgttaatcagctacaaacaaaa gtggcagttttagaacaggaaatcaaggataaggaccagcttgttttaagaacaaaagag gcatttgatacaatccaggaacaaaaggtggttttagaagaaaatggtgagaaaaatcaa gtacaactaggaaagcttgaagctacaataaaatcattatctgcagaacttctgaaggca aatgaaattatcaagaagttacaaggggatctgaaaactttaatgggtaagttgaaattg aagaatacagttactattcagcaagaaaaactcttggctgagaaggaggaaaaattacaa aaggaacaaaaggaattacaagatgttggacagtctcttcgaattaaagagcaagaggta tgcaaattacaagaacaattagaagctacagttaaaaaacttgaagaaagcaaacaactt ctaaaaaataatgaaaagttaatcacgtggttaaataaagaactaaatgaaaatcagcta gtgagaaagcaagatgtattgggaccttctactactccgcctgcacattccagcagcaac acaatcagaagtggaatttctcctaacctgaatgtggttgatggtagactgacttaccca acctgtgggattggttatcctgtctcctctgcatttgcattccagaataccttccctcat tcgatatctgccaaaaataccagccaccctggttcaggaacaaaggttcagtttaatttg cagtttacaaaaccaaatgcatcactaggagatgttcagtcaggagcaactattagtatg ccttgctcaactgataaggaaaatggtgaaaatgtagggttggaatccaaatacctgaag aaaagggaagatagcattcctttacgcggactcagccagaacctatttagtaattcagac catcagagagatggcactttaggagcattacatacatcttccaaacccacagcgctcccc tctgcgtcttcagcctatttccctgggcagttaccaaacagttaa