GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:17:34 Sequence gi568815581f:37323509_37576979 : 253471 bp : 43.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 6917 6665 253 0 1 116 116 199 0.913 21.99 1.02 Intr - 16342 16312 31 1 1 137 61 19 0.222 1.80 1.01 Init - 35878 35678 201 1 0 45 59 167 0.159 6.54 1.00 Prom - 41866 41827 40 -2.26 2.00 Prom + 42003 42042 40 -7.76 2.01 Init + 52585 52642 58 0 1 90 95 65 0.989 8.87 2.02 Intr + 54371 54457 87 0 0 117 97 26 0.986 6.34 2.03 Intr + 55629 55874 246 1 0 108 100 20 0.825 2.63 2.04 Intr + 65162 65286 125 0 2 114 95 41 0.991 7.70 2.05 Term + 68139 68216 78 2 0 111 48 75 0.992 3.56 2.06 PlyA + 69817 69822 6 1.05 3.00 Prom + 71387 71426 40 -5.06 3.01 Init + 76199 76244 46 1 1 74 22 49 0.030 -2.26 3.02 Intr + 100001 100107 107 0 2 95 93 73 0.706 8.43 3.03 Intr + 103442 103501 60 1 0 95 50 68 0.774 2.63 3.04 Intr + 114230 114321 92 1 2 87 91 74 0.897 6.39 3.05 Intr + 116997 117154 158 0 2 72 93 179 0.999 16.45 3.06 Intr + 119056 119144 89 2 2 62 75 69 0.843 2.69 3.07 Intr + 121188 121260 73 2 1 62 47 92 0.420 1.48 3.08 Intr + 141923 142033 111 0 0 80 53 149 0.479 10.95 3.09 Intr + 146892 147024 133 1 1 98 84 130 0.960 13.30 3.10 Intr + 147586 147629 44 1 2 115 91 -17 0.919 -0.82 3.11 Intr + 149650 149675 26 2 2 77 77 17 0.624 -2.76 3.12 Intr + 150224 150293 70 1 1 120 97 8 0.632 3.65 3.13 Intr + 151048 151121 74 2 2 121 77 17 0.967 3.03 3.14 Term + 153289 153474 186 0 0 88 48 90 0.975 2.49 3.15 PlyA + 153895 153900 6 1.05 4.00 Prom + 154980 155019 40 -6.06 4.01 Init + 165379 165447 69 0 0 48 105 52 0.009 3.95 4.02 Intr + 176493 176636 144 2 0 90 36 71 0.001 2.58 4.03 Intr + 183279 183388 110 2 2 62 84 3 0.005 -3.62 4.04 Intr + 188673 188777 105 0 0 89 27 74 0.500 0.73 4.05 Term + 188878 189361 484 1 1 89 39 607 0.802 50.12 4.06 PlyA + 189632 189637 6 1.05 5.13 PlyA - 190835 190830 6 1.05 5.12 Term - 195563 195432 132 2 0 89 47 138 0.998 7.89 5.11 Intr - 197140 197030 111 1 0 100 114 24 0.989 6.78 5.10 Intr - 212619 212471 149 1 2 99 72 150 0.790 14.45 5.09 Intr - 214912 214816 97 1 1 100 77 34 0.982 3.08 5.08 Intr - 217035 216872 164 1 2 110 78 155 0.686 16.39 5.07 Intr - 219057 218464 594 1 0 102 113 321 0.961 28.54 5.06 Intr - 230551 229607 945 2 0 86 94 732 0.904 64.72 5.05 Intr - 237749 237687 63 0 0 114 97 18 0.939 3.99 5.04 Intr - 238081 237963 119 0 2 90 83 72 0.990 7.01 5.03 Intr - 245416 245283 134 1 2 57 75 90 0.991 4.04 5.02 Intr - 247377 247129 249 0 0 82 99 51 0.460 3.23 5.01 Intr - 248479 248283 197 2 2 81 97 85 0.657 7.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:37323509_37576979|GENSCAN_predicted_peptide_1|162_aa MADPRSDHRHRLTSLRPAPPSHWLPLSPRARASPPGGPGSAAARSGVGGERPGVTPDVVT SWSSPAEVFLEVDIYSDTVRAHFGGIMDEPSPLAQPLELNQHSRFIIGSVSEDNSEDEIS NLVKLDLLEEKEGSLSPASVGSDTLSDLGISSLQDGLALHIS >gi568815581f:37323509_37576979|GENSCAN_predicted_CDS_1|486_bp atggcagatccgcgctcggaccatcggcatcgcctcacatcgctccgccccgcgccgccc tcccattggctgccgctgagccctcgggcccgcgcctcgccccccggcggccctggcagc gccgcagcccggagcggggtcggaggtgaacggcctggagtaaccccggacgtagtcacc tcatggagctcgccggctgaggtctttttggaagtggatatctactcagacactgtaaga gctcattttggaggaataatggatgaaccatctcccttggcccaacctctggagctgaac cagcactctcgattcataataggttctgtgtctgaagataactcagaggatgagatcagc aacctggtgaagttggacctactggaggagaaggagggctccttgtcacctgcttctgtt ggctcagatacactctctgatttggggatctctagcctacaggatggcttggccttgcac ataagn >gi568815581f:37323509_37576979|GENSCAN_predicted_peptide_2|197_aa MDTILVFSLIIASYDANKKDLRDSSCRLEQLPGIFPKDVRSIRELQMQETHTETKRTTFI QNRTIATLQCLGSDSKVKVNLVYLERRPKVKHILKNLRIIAAPRRNSSASSSCHLIPTSK FQTGSLLKGKDTDENLEKRQKWSIVVKILIAVTLLLSGVAIIVFVIFEVPCPVGQDAANS SNPKKAAEITVIHQTYF >gi568815581f:37323509_37576979|GENSCAN_predicted_CDS_2|594_bp atggataccatcttggtcttcagcctaatcattgcatcctatgatgccaacaagaaagac ctcagagatagcagttgccgactggaacagctgcctgggatcttcccaaaagacgtgaga agcatcagagaattgcaaatgcaagaaactcacacagaaaccaaaaggacaacattcatt caaaaccggactatagctaccctgcagtgccttggctctgacagcaaagtaaaagtcaac cttgtatatttggagagaaggccaaaggtcaagcatattttgaagaacctgagaatcatt gctgctccccgcagaaacagctctgcctcctcaagctgtcacctaatccccacatccaag tttcagactggatctcttctaaaaggcaaagacacagatgagaacctagagaagagacag aaatggagtattgtggtcaaaattctgattgctgtcaccctgttgctcagtggagttgcc attatagtatttgtaatttttgaagtcccatgtcctgttggacaagatgctgccaattca tcaaacccaaagaaagctgcagagatcactgttatccaccagacatacttctga >gi568815581f:37323509_37576979|GENSCAN_predicted_peptide_3|422_aa MGNEGLDSRKLFNGHDDPSDKPPCRGCSSYLMEPYIKCAECGPPPFFLCLQCFTRGFEYK KHQSDHTYEIMTSDFPVLDPSWTAQEEMALLEAVMDCGFGNWQDVANQMCTKTKEECEKH YMKHFINNPLFASTLLNLKQAEEAKTADTAIPFHSTDDPPRPTFDSLLSRDMAGYMPARA DFIEEFDNYAEWDLRDIDFVEDDSDILHVMERRYPKEVQDLYETMRRFARIVGPVEHDKF IESHACARTYDHLKKTREEERLKRTMLSEVLQYIQDSSACQQWLRRQADIDSGLSPSIPM ASNSETRSYYITQVCELTSAPALPYNGSLFLFVDLAGRRSAPPLNLTGLPGTEKLNEKEK ELCQMVRLVPGAYLEYKSALLNECNKQGGLRLAQARALIKIDVNKTRKIYDFLIREGYIT KG >gi568815581f:37323509_37576979|GENSCAN_predicted_CDS_3|1269_bp atgggtaacgaaggcctggattcacgaaaactgtttaatggccatgatgatccctctgat aagccaccttgccgaggctgctcctcctacctcatggagccttatatcaagtgtgctgaa tgtgggccacctccttttttcctctgcttgcagtgtttcactcgaggctttgagtacaag aaacatcaaagcgatcatacttatgaaataatgacttcagattttcctgtccttgatccc agctggactgctcaagaagaaatggcccttttagaagctgtgatggactgtggctttgga aattggcaggatgtagccaatcaaatgtgcaccaagaccaaggaggagtgtgagaagcac tatatgaagcatttcatcaataaccctctgtttgcatctaccctgctgaacctgaaacaa gcagaggaagcaaaaactgctgacacagccattccatttcactctacagatgaccctccc cgacctacctttgactccttgctttctcgggacatggccgggtacatgccagctcgagca gatttcattgaggaatttgacaattatgcagaatgggacttgagagacattgattttgtt gaagatgactcggacattttacatgtaatggaacggcggtatcccaaggaggtccaggac ctgtatgaaacaatgaggcgatttgcaagaattgtggggccagtggaacatgacaaattc attgaaagccatgcatgtgccagaacctacgatcacctcaagaagacacgggaggaagag cgccttaaacgcactatgctctcagaagttctccagtatatccaggacagtagtgcttgc cagcagtggctccgccggcaagctgacattgattccggcctgagtccttccattccaatg gcttcgaattcagaaacaaggtcttactatatcacccaggtgtgtgagttgacctcagcc cctgctttgccatataatggctcattgttcctgtttgtggatcttgcaggtagacggagt gcaccacccttgaacctcactggcctccctggcacagagaagctgaatgaaaaagaaaag gagctctgtcagatggtgaggttggtccctggagcctatttagaatacaaatctgctcta ttgaacgaatgtaacaagcaaggaggcttaagactggcgcaggcaagagcactcatcaag atagatgtgaacaaaacccggaaaatctatgatttcctcatcagagaaggatacatcact aaaggctaa >gi568815581f:37323509_37576979|GENSCAN_predicted_peptide_4|303_aa MPTRFDCSAILRPTICNGEEQGLDGTDTQLPSQCSRTALSMGAGMAFASCLHPAEAEDSA VYLTLLGNSSVPGGCFVHQMVYQDVAVVPCSGTRGNWSSSAHLGLAFLKETLILDSLVEE NKTLWSCRQRCKCDCWRLHELQRGSVASNRHLLQARGITCIVNATIEIPNFNWPQFEYVK VPLADMPHAPIGLYFDTVADKIHSVSRKHGATLVHCAAGVSRSATLCIAYLMKFHNVCLL EAYNWVKARRPVIRPNVGFWRQLIDYERQLFGKSTVKMVQTPYGIVPDVYEKESRHLMPY WGI >gi568815581f:37323509_37576979|GENSCAN_predicted_CDS_4|912_bp atgcctactcgttttgactgctcagcaattctgaggccaaccatttgcaacggcgaggag caggggctggatggcacagacacccagttgccctcacagtgcagtcggactgctctcagc atgggagcgggcatggccttcgccagttgcctccatcccgctgaagcagaagactcagct gtttacctcaccctgctcggaaactctagcgtgccagggggatgttttgtgcaccaaatg gtgtaccaggatgtggcagtagttccctgttctggcactaggggaaactggagcagctcg gcccatctgggcttagcctttctgaaggagactctaattttggattccttggtggaggaa aataaaacactctggtcttgccgccaacgatgcaagtgtgactgctggcgtcttcatgag ctccagagaggcagtgtggcctccaatcggcacctcctccaggctcgtggcatcacctgc attgttaatgctaccattgagatccctaatttcaactggccccaatttgagtatgttaaa gtgcctctggctgacatgccgcatgcccccattggactgtactttgacaccgtggctgac aagatccacagtgtgagcaggaagcacggggccaccttggtgcactgtgctgcaggggtg agccgctcagccacgctgtgtatcgcgtacctgatgaaattccacaacgtgtgcctgctg gaggcgtacaactgggtgaaagcccggcgacctgtcatcaggcccaacgtaggcttctgg aggcaactgatagactacgagcgccagctctttgggaagtcgacagttaaaatggtacag acaccttatggcatagttcccgacgtctatgagaaggagtcccgacacctgatgccttac tgggggatttag >gi568815581f:37323509_37576979|GENSCAN_predicted_peptide_5|984_aa XAYKKILETTMTPTGIDTAKLYPILMSSGLPRETLGQIWALANRTTPGKLTKEELYTVLA MIAVTQRGVPAMSPDALNQFPAAPIPTLSGFSMTLPTPVSQPTVIPSGPAGSMPLSLGQP VMGINLVGPVGGAAAQASSGFIPTYPANQVVKPEEDDFQDFQDASKSGSLDDSFSDFQEL PASSKTSNSQHGNSAPSLLMPLPGTKALPSMDKYAVFKGIAADKSSENTVPPGDPGDKYS AFRELEQTAENKPLGESFAEFRSAGTDDGFTDFKTADSVSPLEPPTKDKTFPPSFPSGTI QQKQQTQVKNPLNLADLDMFSSVNCSSEKPLSFSAVFSTSKSVSTPQSTGSAATMTALAA TKTSSLADDFGEFSLFGEYSGLAPVGEQDDFADFMAFSNSSISSEQKPDDKYDALKEEAS PVPLTSNVGSTVKGGQNSTAASTKYDVFRQLSLEGSGLGVEDLKDNTPSGKSDDDFADFH SSKFSSINSDKSLGEKAVAFRHTKEDSASVKSLDLPSIGGSSVGKEDSEDALSVQFDMKL ADVGGDLKHVMSDSSLDLPTVSGQHPPAADIEDLKYAAFGSYSSNFAVSTLTSYDWSDRD DATQGRKLSPFVLSAGSGSPSATSILQKKETSFGSSENITMTSLSKVTTFVSEDALPETT FPALASFKDTIPQTSEQKEYENRDYKDFTKQDLPTAERSQEATCPSPASSGASQETPNEC SDDFGEFQSEKPKISKFDFLVATSQSKMKSSEEMIKSELATFDLSVQGSHKRSLSLGDKE ISRSSPSPALEQPFRDRSNTLNEKPALPVIRDKYKDLTGEVEVIKKANDTLNGISSSSVC TEVIQSAQGMEYLLGVVEVYRVTKRVELGIKATAVCSEKLQQLLKDIDKVWNNLIGFMSL ATLTPDENSLDFSSCMLRPGIKNAQELACGVCLLNVDSRSRAFNSETDSFKLAYGGHQYH ASCANFWINCVEPKPPGLVLPDLL >gi568815581f:37323509_37576979|GENSCAN_predicted_CDS_5|2955_bp natgcctataagaaaatcttagaaaccacaatgactccaactggaatagatactgccaaa ctgtatcccattctgatgtcatctgggcttcccagggaaactcttggacagatatgggcc ttagctaatcgaactacacctggcaaacttacaaaagaagaactttataccgttctagcc atgatagcggtaacacagaggggcgttcctgcaatgagtcctgatgctttaaaccagttc ccagcagctcctattccaactttaagtggcttttctatgactctgcctacaccggtgagt cagccaactgtgataccttcaggtcctgcgggctccatgcccctcagccttggacagcca gtcatgggcattaaccttgttggaccagtgggtggagctgcagcccaggcttctagtggt ttcataccaacctaccctgcaaatcaggtagtaaagccagaagaagatgacttccaggat tttcaagatgcttctaagtcaggatcccttgatgactcattcagtgatttccaagagttg cctgcttcttcaaaaacaagtaactcccagcatggaaacagtgccccttctttgttgatg ccacttcctggaactaaagcattgccttcaatggacaaatatgctgtgtttaaaggaatt gcagctgacaagtcctctgaaaatactgttccacctggagatcctggtgataaatatagt gctttcagagaacttgaacagacagcagagaataaacctttaggagaaagctttgcagaa ttcagatctgcaggaactgatgatggtttcaccgattttaaaacagccgatagtgtatca ccactagagccaccaacaaaagacaaaacttttccaccatccttcccctcaggaactata caacagaaacaacaaacacaagtgaaaaaccctctgaacttagcagacctagatatgttt tcctcagttaattgcagcagcgagaaaccattgtctttttcagctgtgtttagcacatca aaatcagtttctacaccacagtcaacaggttctgctgctactatgacagcattggcagca acaaaaacttctagtttggctgatgattttggagaattcagcctttttggggaatattct ggtctagcacctgttggggagcaggatgactttgcagattttatggctttcagtaatagc tctatttcatctgagcaaaagccggatgacaaatatgatgcccttaaagaggaagccagt cctgttcctctaaccagcaacgtgggcagcacagtgaagggtggacaaaactcgactgct gcgtctaccaagtacgatgtcttcagacaactttctctggaagggtctggactaggtgtt gaagacctgaaagataacactccttcaggaaaaagtgatgatgattttgctgacttccac tccagtaaattttcttccataaactcggacaaatccctgggagagaaagcagtggctttc agacacaccaaagaagactctgcatcagtgaagtccttagatctcccttccattggtggc agcagtgttggcaaggaggactctgaagatgcactctctgttcagtttgacatgaaattg gctgatgtgggaggagatcttaagcatgtcatgtctgatagctctttggatttaccaaca gttagtggccagcatcctcctgctgcagatatagaggacttaaaatatgctgcttttgga agctacagtagcaattttgcagtgagcacacttacaagctatgactggtcagacagggat gatgcaactcagggcagaaaactctctccatttgtcctctcagcaggaagtggatccccc tcagccacctcaattcttcaaaagaaagagacttcatttggcagttctgaaaacatcacc atgacatctctctccaaagtaacgacctttgtaagtgaagatgctcttccagagaccacc ttcccagctcttgccagttttaaagacacgattcctcagaccagtgagcaaaaggaatat gaaaacagagactataaagatttcacaaaacaggacctgcctacggctgaacggagccag gaggccacgtgtcccagcccagcgtccagtggtgcctctcaagaaaccccgaacgaatgt tcggatgactttggagagtttcaaagtgaaaagcccaaaatcagcaaatttgacttctta gtagccacttcacaaagcaaaatgaaatccagtgaagaaatgatcaaaagtgagctggca acctttgacctttctgttcaaggatcacacaagaggagtttgagccttggtgataaagaa ataagccgttcttctccttctccagctttggagcagcctttcagagaccgttccaatact ctgaatgagaagcccgccctgcccgtcatccgagacaagtacaaagacctgacgggagag gtggaggtcattaagaaggcaaatgataccttaaatggaatcagtagtagttctgtttgc acagaagtaattcagtcagctcaaggcatggaatatttattaggtgttgttgaagtgtac agggtaaccaagcgtgtggagctggggataaaagccactgcagtgtgcagtgagaaactc cagcagttgctgaaggacatcgataaagtatggaataacctaatcggcttcatgtcactc gccacactcacaccagatgaaaactcgctggatttttcctcctgtatgttacggcctggg attaaaaatgctcaggagcttgcctgtggagtgtgcctcttgaatgtggactcgaggagc cgggcattcaactcagaaacagacagtttcaagctggcctatggagggcaccagtatcac gccagctgtgccaacttctggatcaactgtgtcgaaccaaagcctcctggcctcgtcctg cctgacctgctctga