GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:14:05 Sequence gi568815597f:198539269_198856178 : 316910 bp : 35.73% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 1417 1301 117 2 0 43 89 127 0.217 7.82 1.01 Init - 7657 7561 97 1 1 57 72 68 0.157 2.62 1.00 Prom - 27581 27542 40 -0.15 2.00 Prom + 45798 45837 40 -4.55 2.01 Init + 59746 59893 148 0 1 49 81 103 0.098 6.00 2.02 Intr + 67450 67521 72 2 0 67 93 73 0.011 4.16 2.03 Intr + 92173 92456 284 2 2 42 52 160 0.146 4.01 2.04 Term + 97215 97457 243 2 0 52 36 147 0.437 0.82 2.05 PlyA + 98329 98334 6 1.05 3.00 Prom + 99550 99589 40 -6.95 3.01 Init + 100007 100073 67 1 1 62 111 50 0.479 6.01 3.02 Intr + 110696 110823 128 0 2 6 74 155 0.316 5.38 3.03 Intr + 138267 138313 47 2 2 77 98 9 0.007 -2.91 3.04 Intr + 139753 139922 170 1 2 23 57 144 0.051 3.47 3.05 Term + 140644 140852 209 2 2 51 48 179 0.281 6.72 3.06 PlyA + 142322 142327 6 1.05 4.00 Prom + 144179 144218 40 -5.45 4.01 Init + 154320 154329 10 2 1 42 111 0 0.380 -1.64 4.02 Intr + 154750 154872 123 2 0 22 83 113 0.473 3.84 4.03 Intr + 157444 157641 198 2 0 68 52 124 0.491 5.20 4.04 Intr + 160296 160436 141 1 0 108 41 263 0.652 23.00 4.05 Intr + 163119 163262 144 1 0 79 41 89 0.553 2.63 4.06 Intr + 164030 164104 75 0 0 108 27 63 0.558 0.77 4.07 Intr + 165204 165230 27 1 0 103 103 9 0.669 1.07 4.08 Intr + 167466 167684 219 1 0 96 99 68 0.980 6.05 4.09 Intr + 168865 168993 129 2 0 80 83 27 0.688 1.15 4.10 Intr + 170419 170556 138 2 0 78 64 88 0.806 4.91 4.11 Intr + 177414 177572 159 1 0 87 99 63 0.937 6.34 4.12 Intr + 178826 179034 209 0 2 75 93 123 0.970 9.37 4.13 Intr + 188496 188541 46 2 1 91 110 -7 0.936 -1.14 4.14 Intr + 189072 189180 109 1 1 108 99 92 0.997 10.72 4.15 Intr + 189869 189903 35 2 2 44 98 65 0.974 0.05 4.16 Intr + 192349 192458 110 2 2 70 89 115 0.996 8.88 4.17 Intr + 193032 193122 91 2 1 66 70 113 0.996 5.95 4.18 Intr + 195060 195157 98 1 2 91 90 176 0.953 16.91 4.19 Intr + 195859 195984 126 0 0 26 82 130 0.983 6.16 4.20 Intr + 202601 202758 158 1 2 16 91 110 0.962 1.99 4.21 Intr + 202964 203099 136 2 1 56 87 211 0.967 17.45 4.22 Intr + 204786 204935 150 2 0 57 95 101 0.895 7.14 4.23 Intr + 208841 208931 91 1 1 109 50 49 0.799 1.85 4.24 Intr + 210148 210281 134 2 2 21 78 129 0.430 4.64 4.25 Intr + 211224 211358 135 2 0 32 72 141 0.987 6.74 4.26 Intr + 212981 213103 123 1 0 60 86 136 0.994 10.46 4.27 Intr + 213326 213504 179 1 2 63 96 127 0.999 8.80 4.28 Intr + 215001 215136 136 0 1 106 88 80 0.988 9.45 4.29 Term + 216638 216913 276 1 0 82 49 324 0.984 22.38 4.30 PlyA + 217354 217359 6 1.05 5.08 PlyA - 217669 217664 6 1.05 5.07 Term - 236666 236548 119 0 2 83 43 125 0.216 5.22 5.06 Intr - 248013 247985 29 2 2 82 81 38 0.002 -0.66 5.05 Intr - 258549 258263 287 0 2 40 30 201 0.005 4.72 5.04 Intr - 274322 274077 246 2 0 116 68 55 0.005 3.03 5.03 Intr - 287841 287758 84 0 0 60 95 62 0.037 3.20 5.02 Intr - 288451 288287 165 1 0 22 18 190 0.356 4.64 5.01 Init - 289653 289393 261 0 0 52 63 146 0.640 5.61 5.00 Prom - 294808 294769 40 -4.45 6.03 PlyA - 294945 294940 6 1.05 6.02 Term - 300226 300083 144 1 0 82 46 98 0.260 1.93 6.01 Init - 302435 302319 117 2 0 53 9 140 0.393 2.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:198539269_198856178|GENSCAN_predicted_peptide_1|72_aa MNQCVGVQELRLGKFKMPFTSPGGYESDIQGRDFTEATCLEQTTMTSQSQGIHQLLQAEK RAKDKLEEAKKX >gi568815597f:198539269_198856178|GENSCAN_predicted_CDS_1|216_bp atgaaccaatgtgttggtgttcaggaattgagactgggaaagttcaagatgcctttcaca tctccaggtggatatgaatctgacattcagggcagagatttcacagaagccacttgcttg gagcagactaccatgacaagccagtctcaggggatccaccagcttcttcaggcagaaaaa cgggccaaggacaagctagaggaagccaagaagann >gi568815597f:198539269_198856178|GENSCAN_predicted_peptide_2|248_aa MVVVKVHGWVIKPWLETQLPISGAVTTAGLMIQGQEQRLREHSRDGRTHAMEEGEKNCHD YSVLIESGFKPIKDMQMAEPRQESHCQKPVQKEMQRQRETERQRFQQPAAIKGIDKCYTW LVTLNVRPEVKHIHLSLDLMEQEKYEIHRRPLQSGRLPSCVKSYKTFSTQLFFAKYPFIS NQPTLPLGISESGSHQYVINTPPVFFESFLLMQAEGLDARPVCDTVEISPDALWKDKLLP GVLPGCLA >gi568815597f:198539269_198856178|GENSCAN_predicted_CDS_2|747_bp atggtggtggtcaaagttcatggttgggttatcaagccatggttagaaacacagctgccg atatctggggctgtaacaactgcaggtctaatgatccaaggccaggaacaaagattaagg gagcactctagggatgggagaacccatgctatggaagaaggagagaaaaactgccatgat tactccgtgttaattgaaagtggttttaaaccaatcaaagatatgcaaatggcagaaccc agacaggaaagtcattgccaaaagccagtgcagaaagagatgcagagacagagagagaca gagagacagaggtttcagcaacctgcagccatcaaaggaatcgacaaatgttatacatgg cttgtaacactgaatgtgaggcctgaagtaaaacacatacatctcagcttagacctcatg gaacaagaaaaatatgaaatccacagaaggccactccagagtggaagactcccttcatgt gtgaaaagttacaagactttctcaactcagttgttctttgcaaaatacccatttatcagc aaccagcccaccctccctctgggaatcagtgagagtggatcacatcagtatgtcataaat actcctcctgttttctttgagagttttctgctgatgcaagcggaaggcttggatgccagg cctgtatgtgatacagtagaaatttccccagatgccctgtggaaagacaagctgctaccc ggagtccttcctggatgtctggcttag >gi568815597f:198539269_198856178|GENSCAN_predicted_peptide_3|206_aa MYLWLKLLAFGFAFLDTEVFVTDRVIGEDPYEEESLSYLLADKNMGIHVRETGKSKGYLA KPNKLTLLILGESSMITLRIGSCMCKGEPEEEKSIIHATLNWRTRLGASGTVQVPELPPT SQTAAPAVWAHSRPTQPGGRQHFAQLVHRRPGVGDMQRVLRLPVHFLQAQGPGPLAEVPG MAVQERSAQLGVPLGKAQAQRRLGSL >gi568815597f:198539269_198856178|GENSCAN_predicted_CDS_3|621_bp atgtatttgtggcttaaactcttggcatttggctttgcctttctggacacagaagtattt gtgacagacagagtgatcggggaagatccttatgaagaagaatcattgagctacctgttg gctgacaagaatatggggatccatgtgagggagactggcaagagtaaaggttatctggca aaaccaaacaaattgacattacttattttgggggagtcttccatgattaccctaagaata gggtcttgcatgtgcaaaggggagccggaagaggagaaatctattatacatgcaacactg aactggagaacacggcttggggcctctgggacagttcaggtccctgagctgccccctact tcccagacagctgctcctgcagtttgggcacatagtcgtcccactcagcctggtgggcgt cagcactttgcccagctcgtccaccgtcgccctggcgtaggcgacatgcagcgagtgctg cggctgcccgtccacttcctgcaggctcaagggcctggaccgcttgctgaggtccctggc atggctgtgcaagaacgcagcgcgcagctgggagtgccactaggcaaggcgcaggcgcag cggcgactcggcagcctctga >gi568815597f:198539269_198856178|GENSCAN_predicted_peptide_4|1234_aa MKPGHLQAEEQGSQSKSPNLKSREADSSAFSWWPKAREPLTNHWRLTTAKMPSVPLSSDP LPTHTTAFSPASTFERENDFSETTTSLSPDNTSTQVSPDSLDNASAFNTTGVSSVQTPHL PTHADSQTPSAGTDTQTFSGSAANAKLNPTPGSNAISDVPGERSTASTFPTDPVSPLTTT LSLAHHSSAALPARTSNTTITANTSDAYLNASETTTLSPSGSAVISTTTIATTPSKPTCD EKYANITVDYLYNKETKLFTAKLNVNENVECGNNTCTNNEVHNLTECKNASVSISHNSCT APDKTLILDVPPGVEKFQLHDCTQVEKADTTICLKWKNIETFTCDTQNITYRFQCGNMIF DNKEIKLENLEPEHEYKCDSEILYNNHKFTNASKIIKTDFGKKDCLNLDKNLIKYDLQNL KPYTKYVLSLHAYIIAKVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTSDNSMHVKCRPPR DRNGPHERYHLEVEAGNTLVRNESHKNCDFRVKDLQYSTDYTFKERAGFHIKLSEYLSQD NSKALIAFLAFLIIVTSIALLVVLYKIYDLHKKRSCNLDEQQELVERDDEKQLMNVEPIH ADILLETYKRKIADEGRLFLAEFQSIPRVFSKFPIKEARKPFNQNKNRYVDILPCPRDET VDDFWRMIWEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGTRAFGDVVVKINQHKRCPDY IIQKLNIVNKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFFSGPIVVH CSAGVGRTGTYIGIDAMLEGLEAENKVDVYGYVVKLRRQRCLMVQVEAQYILIHQALVEY NQFGETEVNLSELHPYLHNMKKRDPPSEPSPLEAEFQRLPSYRSWRTQHIGNQEENKSKN RNSNVIPYDYNRVPLKHELEMSKESEHDSDESSDDDSDSEEPSKYINASFIMSYWKPEVM IAAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWGEGKQTYGDIEVDLKD TDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPKELISMIQVVKQKLPQKN SSEGNKHHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEEVVDIFQVVKALRKARPGMV STFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNEVDKVKQDANCVNPLGAPEKLP EAKEQAEGSEPTSGTEGPEHSVNGPASPALNQGS >gi568815597f:198539269_198856178|GENSCAN_predicted_CDS_4|3705_bp atgaaaccaggccatctgcaagctgaggagcaaggaagccaatccaagtcaccaaacctc aaaagtagggaagctgacagttcagccttcagttggtggccaaaggcccgagagcccctc acaaaccactggagattgactacagcaaagatgcccagtgttccactttcaagtgacccc ttacctactcacaccactgcattctcacccgcaagcacctttgaaagagaaaatgacttc tcagagaccacaacttctcttagtccagacaatacttccacccaagtatccccggactct ttggataatgctagtgcttttaataccacaggtgtttcatcagtacagacgcctcacctt cccacgcacgcagactcgcagacgccctctgctggaactgacacgcagacattcagcggc tccgccgccaatgcaaaactcaaccctaccccaggcagcaatgctatctcagatgtccca ggagagaggagtacagccagcacctttcctacagacccagtttccccattgacaaccacc ctcagccttgcacaccacagctctgctgccttacctgcacgcacctccaacaccaccatc acagcgaacacctcagatgcctaccttaatgcctctgaaacaaccactctgagcccttct ggaagcgctgtcatttcaaccacaacaatagctactactccatctaagccaacatgtgat gaaaaatatgcaaacatcactgtggattacttatataacaaggaaactaaattatttaca gcaaagctaaatgttaatgagaatgtggaatgtggaaacaatacttgcacaaacaatgag gtgcataaccttacagaatgtaaaaatgcgtctgtttccatatctcataattcatgtact gctcctgataagacattaatattagatgtgccaccaggggttgaaaagtttcagttacat gattgtacacaagttgaaaaagcagatactactatttgtttaaaatggaaaaatattgaa acctttacttgtgatacacagaatattacctacagatttcagtgtggtaatatgatattt gataataaagaaattaaattagaaaaccttgaacccgaacatgagtataagtgtgactca gaaatactctataataaccacaagtttactaacgcaagtaaaattattaaaacagatttt gggaaaaaagattgcctcaatctggataaaaacctgatcaaatatgatttgcaaaattta aaaccttatacgaaatatgttttatcattacatgcctacatcattgcaaaagtgcaacgt aatggaagtgctgcaatgtgtcatttcacaactaaaagtgctcctccaagccaggtctgg aacatgactgtctccatgacatcagataatagtatgcatgtcaagtgtaggcctcccagg gaccgtaatggcccccatgaacgttaccatttggaagttgaagctggaaatactctggtt agaaatgagtcgcataagaattgcgatttccgtgtaaaagatcttcaatattcaacagac tacacttttaaggaaagggcaggatttcacattaaacttagtgaatatttaagtcaggat aattctaaggcactgatagcatttctggcatttctgattattgtgacatcaatagccctg cttgttgttctctacaaaatctatgatctacataagaaaagatcctgcaatttagatgaa cagcaggagcttgttgaaagggatgatgaaaaacaactgatgaatgtggagccaatccat gcagatattttgttggaaacttataagaggaagattgctgatgaaggaagactttttctg gctgaatttcagagcatcccgcgggtgttcagcaagtttcctataaaggaagctcgaaag ccctttaaccagaataaaaaccgttatgttgacattcttccttgtcccagggatgaaact gttgatgatttctggaggatgatttgggaacagaaagccacagttattgtcatggtcact cgatgtgaagaaggaaacaggaacaagtgtgcagaatactggccgtcaatggaagagggc actcgggcttttggagatgttgttgtaaagatcaaccagcacaaaagatgtccagattac atcattcagaaattgaacattgtaaataaaaaagaaaaagcaactggaagagaggtgact cacattcagttcaccagctggccagaccacggggtgcctgaggatcctcacttgctcctc aaactgagaaggagagtgaatgccttcagcaatttcttcagtggtcccattgtggtgcac tgcagtgctggtgttgggcgcacaggaacctatatcggaattgatgccatgctagaaggc ctggaagccgagaacaaagtggatgtttatggttatgttgtcaagctaaggcgacagaga tgcctgatggttcaagtagaggcccagtacatcttgatccatcaggctttggtggaatac aatcagtttggagaaacagaagtgaatttgtctgaattacatccatatctacataacatg aagaaaagggatccacccagtgagccgtctccactagaggctgaattccagagacttcct tcatataggagctggaggacacagcacattggaaatcaagaagaaaataaaagtaaaaac aggaattctaatgtcatcccatatgactataacagagtgccacttaaacatgagctggaa atgagtaaagagagtgagcatgattcagatgaatcctctgatgatgacagtgattcagag gaaccaagcaaatacatcaatgcatcttttataatgagctactggaaacctgaagtgatg attgctgctcagggaccactgaaggagaccattggtgacttttggcagatgatcttccaa agaaaagtcaaagttattgttatgctgacagaactgaaacatggagaccaggaaatctgt gctcagtactggggagaaggaaagcaaacatatggagatattgaagttgacctgaaagac acagacaaatcttcaacttatacccttcgtgtctttgaactgagacattccaagaggaaa gactctcgaactgtgtaccagtaccaatatacaaactggagtgtggagcagcttcctgca gaacccaaggaattaatctctatgattcaggtcgtcaaacaaaaacttccccagaagaat tcctctgaagggaacaagcatcacaagagtacacctctactcattcactgcagggatgga tctcagcaaacgggaatattttgtgctttgttaaatctcttagaaagtgcggaaacagaa gaggtagtggatatttttcaagtggtaaaagctctacgcaaagctaggccaggcatggtt tccacattcgagcaatatcaattcctatatgacgtcattgccagcacctaccctgctcag aatggacaagtaaagaaaaacaaccatcaagaagataaaattgaatttgataatgaagtg gacaaagtaaagcaggatgctaattgtgttaatccacttggtgccccagaaaagctccct gaagcaaaggaacaggctgaaggttctgaacccacgagtggcactgaggggccagaacat tctgtcaatggtcctgcaagtccagctttaaatcaaggttcatag >gi568815597f:198539269_198856178|GENSCAN_predicted_peptide_5|396_aa MSFVSHIASGKEANIIPTGREAVPMDDPRRDPNDEVEAWKRRHFQECIMEGLCRTRTKPL NYTKLSMIDQRFDGNPTAFLEKIREALPQDPSGQGHKLLDIERNTSAEEDASSWTSRGRR GEQTSGRAHDAGTPAGHQSSAGKYQSTINPSNEKNNNSLKDKVDVEEKKKTSQHLGKIER TYVEILGNGSPDNFVYSHGVNQGLWYCTKQDEKFVRTLRSFFTAIHNLLERQLAPVEMIN VTRWILSGYSLHKWVTGRGCNSLEGSEEDRKICEALNHLRDLLNGFGQNADSDIDNEIQA EVVSDGDVQLLGNWSKGRSCHALAKRLAAFCPCPRDLWNFELQRHKLGCRDERESGEGCI LRLLHLEHSNHEQSSSHDRCPRLVSTDPSSCSARVT >gi568815597f:198539269_198856178|GENSCAN_predicted_CDS_5|1191_bp atgagctttgtatcacatatagcttcagggaaggaggcaaatattatcccaactggaaga gaagcagtaccaatggatgaccctagacgggatcctaatgatgaggtggaagcctggaag aggagacactttcaggagtgcataatggaaggcttatgtagaactaggaccaagcctctc aattacactaagttgtccatgattgaccagagatttgatggaaaccccactgccttcctg gagaagataagagaggccctgcctcaagaccctagcgggcagggacacaagctgctggat atcgagaggaatacatccgcggaagaagatgcaagcagctggacgtcaagaggacgtcga ggggagcaaaccagtggaagagcacacgacgctggcacaccggcaggccatcagagttcg gccgggaaataccaaagcacaatcaacccatcaaacgaaaaaaacaacaacagcctaaaa gacaaggttgatgtggaggagaaaaaaaagaccagccaacacttagggaaaatagaaaga acctacgttgaaatattggggaatggttcccctgataactttgtgtactcccatggtgtt aatcagggcttatggtattgcactaagcaggatgaaaaatttgtgagaactttgagatca ttttttactgcaatacacaatttattggagagacaacttgctcctgtggagatgattaat gtcacaaggtggattttaagtggatactcactacacaaatgggtaacaggtaggggctgc aacagtttggagggctcagaagaagacaggaagatttgtgaggctttgaatcatcttaga gacttgttgaatggttttggccaaaatgctgatagtgatatagacaatgaaattcaggct gaagtggtctcagatggagatgtgcaacttcttgggaactggagcaaaggtcgctcttgc catgctttagcaaagagactggcggcattttgcccctgccctagagatctgtggaatttt gaacttcagagacataagttagggtgtcgggatgaaagagagagtggagaaggatgtatt cttcgcctgctacatcttgaacacagcaaccacgagcagtcttcatcccatgaccgctgc ccccgccttgtttccacagacccaagctcctgcagtgctcgagtcacatga >gi568815597f:198539269_198856178|GENSCAN_predicted_peptide_6|86_aa MKGEEREVPGSEEAPMVILQLSVPLRGQLSDYKVGLRPLSLLPEAEGEIILKCKSDDVTP LLKASNDPTFTQRKRGSPCNGPPERM >gi568815597f:198539269_198856178|GENSCAN_predicted_CDS_6|261_bp atgaaaggggaagaaagagaagttccagggtctgaagaggcaccaatggtaatacttcag ttgagtgtgccattacgtggtcagctgagtgactataaagttggtctgcgtcctctgtct cttcttcctgaggcagagggagaaatcattttgaaatgtaaatctgatgatgtcactcct ctccttaaagccagtaatgaccccacttttacacagagaaagagaggaagtccctgcaat ggccccccagaacgcatgtga